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AAGCTTGGAT ATTGATCACA TGATGGAGGT GATGGAAGCA TCTAAGTCTG CAGCGGGGTC 
GGCGTGCCCA AGTCCGCAGG CTTATCAGGC AGCTTTTGAG GGAGCTGAGA 
ACATTATCGT TGTGACGATT ACAGG'tGGGC TATCGGGTAG TTTTAATGCG GCACGTGTAG 
CTAGGGATAT GTATATCGAA GAGCATCCGA ATGTCAATAT CCATTTGATA 
GATAGTTTGT CAGCCAGTGG GGAAATGGAT TTACTTGTAC ACCAAATCAA TCGCTTAATT 
AGTGCAGGAT TAGATTTTCC ACAAGTAGTA GAAGCGATAA CTCACTATCG 
GGAACACAGT AAGCTCCTCT TTGTTTTAGC GAAAGTTGAT AATCTTGTTA AGAAT GGAAG 
ACT G AGC AAA TTGGTAGGCA CTGTCGTTGG TCTTCTCAAT ATCCGTATGG 
TTGGTGAGGC AAGTGCTGAA GGAAAATTAG AGTTGCTTCA AAAGGCGCGT GGTCATAAGA 
AATCTGTGAC AGCAGCCTTT GAAGAAATGA AAAAAGCAGG CTAXGATGGT 
GGTCGAATTG TTATGGCCCA CCGCAACAAT GCTAAGTTCT TCCAACAATT CTCAGAGTTG 
GTAAAAGCAA GTTTTCCAAC GGCTGTTATT GACGAAGTTG CAACATCAGG 
TCTATGCAGT TTTTATGCTG AAGAAGGTGG ACTTTTGATG GGCTACGAAG TGAAAGCGTG 
ATTCACAGAG TAATAATTTT GGGCTGTAAT TTCCGCTATA GAATAATCCC 
CCTCTTCTTC TAAGTTCGAG GGGGATTGTT TGTATGAGAC TATTGGATTT CATTCATTCA 
AATATCTTAC GAATTGCTCC AGTTTATCTG CAAAATCTTG TTCAAAGAAG 
ATCTGTAAGA AATCAGCTTT CTGTCCGCTG AAATAATAAC ATTTTCCAAA CATGTGTTGG 
ATGCTAGGAG AAAGAATCCC CTTGCTTAGC TGAAAGGTCA CGCTCCCCTT 
TGGAATTCGA TACGGGATGT TTAAAGCGTA TTTCTCTAGA CAGTCTTTTA TTTTATTCCA 
TTGAGCGTGA TAAATGTGAT GAAGATGCTG TGTGTTCCGC GCAAACATAC 
CGTTATCAAT GTAGAGCGAG AGAGCTTTTT GCATGATAAG ATTGGTATCG TAGTCGATTA 
GACTCTTATG TTTGATGAAG ATATCACGTA GCTGATTAGG AAGGCTGATT 
GCR.CCGATTC GGAGGGCAGG AAAGAGTGTC GGTGTAAAAG ATTTTATATA GATGACGCGA 
TTATCTGTAT CAAGATAGTG TAAAGGTAGG CTATGACTAG AGTCGAAATC 
TGCTAAATAG TCATCCTCAA TGATGTAGAC ATCGTATTGC TTTGCTAATT TTACGATGGC 
TGTTTTTGTT G CTATATC AT AGGTTGAACC GAGAGGGTTG TGCAAGCGAG 
GAATTGTGTA G AAAAACT T A ATTTTTCCAG TTTGGAAGAT ACTTTCCAAT TCTTCTAGGT 
CAATTCCATC TAAATTCCGT TCAATTGTTT GATAGGGGAT TCCTTGATGT 
CGAATGAGCT CTATCATTCG TGAATAGGTA GGGTTCTCTA TCAAGATTTC CGTTTTTCCA 
GCCAAGGTTT CCATTTGTGT GAGAATATAT AGAGCTTGTT GACTACCAGC 
TGTGATAACC AGCTGGTCTT TTTTTGTATA GACATGATAG TCCATTAACA GACTTTGAAC 
GGAGGAAATC AATTCTGCCA ATCCCTCTTG CTGGTGATAG TAGTTGAATA 
GGTAATTTTC CCGCCCAATA AGACTTTCTT TTAGACAAAT CCGAAAATCT TCATAGGTAA 
TTCTTGAAAG TCTGTAGGAT TGAGCTCTAC AGGTATGGTC TTG GAAATCT 
CTATCCTCTA AGATATAATA ACCGCTTTTT TCGACAGCGT AGATCTTATT TTGGTATTTT 
AATTCCAACA TAGCCTTTTG GACAGTGTCT TTGCTACAAT GATATTGCTC 
GCGGAGTTGA CGGATAGAAG GTAATTTCTC TCCACGTTTG AATCGATGTT CCTCTATTCC 
AGT CAAAAT A TCTTGGATGA TAACTTGATA TTTTTTCATC TAGGTCCCCT 
TTTTTATAGA CTATGTTACT AGCTAGTATA TAGAAAAAAT TGAAGAAAGA CAATATATGA 
ATAATGGGGT TGAGGTTCAG GAATTAAGCT ACTCTATGGT ATAATTAAGT 
GATGAAAATA ATTATACCTA ATGCAAAAGA AGTAAATACA AATCTAGAGA ATGCCTCGTT 
TTATCTCCTG TCTGATCGAA GCAAGCCGGT GCTGGATGCC ATAAGTCAAT 
TTGATGTAAA AAAGATGGCT GCCTTTTATA AATTGAATGA AGCAAAGGCT GAGTTAGAAG 
CTGACCGTTG GTATCGAATC AGGACAGGTC AAGCAAAAAC CTATCCAGCC 
TGGCAGTTAT ATGATGGTCT CATGTATCGT TATATGGATA GGCGAGGTAT AGATT CGAAA 
GAAGAAAATT ATTTACGTGA CCACGTTCGT GTAGCGACAG CCTTATACGG 
ATTGATTCAT CCTTTTGAAT TCATTTCACC TCACCGCTTA GATTTTCAAG GGAGCTTAAA 
GATAGGCAAT CAGTCTTTGA AACAGTACTG GCGACCGTAT TATGACCAAG 
AAGTTGGTGA TGATGAACTG ATTCTCTCAC TGGCTTCGTC AGAATTTGAG CAGGTGTTTT 
CTCCCCAGAT T C AG AAAAGA TTAGTTAAAA TTCTTTTCAT GGAAGAAAAA 
GCAGGTCAGC TAAAAGTTCA CTCGACTATA TCAAAAAAAG GCAGAGGAAG ATTGCTGTCC 
TGGTTGGCTA AGAACAATAT TCAGGAATTA TCGGACATTC AAGATTTTAA 
GGTGGATGGC TTTGAATATT GTACTTCCGA ATCAACGGCA AACCAACTTA CCTTCATACG 
ATCAATAAAA ATGTGAAATT ATGAAAAAGA TAACGTTTTC CAGCGCTAAA 
AAGGGTAGAA AAATATTAAT TTCTATGATA TAATGGATGC GTTATAGGTA AAAGTCTAGG 
AAGGTTGTTT ATGAAAAAGA G AAGCGG AC G AAGTAAGTCG TCCAAGTTCA 
AATTGGTAAA TTTTGCGCTT TTGGGACTTT ATTCCATTAC TCTATGTTTG TTCTTAGTGA 
CCATGTATCG CTATAACATC CTAGATTTCC GGTATTTAAA CTATATTGTG 
ACGCTTTTGC TAGTAGGAGT GGCAGTATTG GCTGGATTAT TGATGTGGCG TAAGAAAGCG 
CGCATATTTA CAGCGCTCTT ACTTGTTTTT TCACTGGTCA TCACGTCTGT ' 
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TGGGATCTAT GGAATGCAAG AAGTTGTAAA ATTTTCAACA CGACTAAATT CAAATTCGAC 
ATTTTCAGAA TATGAAATGA GTATCCTTGT CCCAGCAAAT AGTGATATTA 
CGGACGTTCG TCAGCTTACT AGTATCCTTG CTCCAGCCGA ATACGACCAA GATAACATCA 
CCGCTTTATT GGATGACATA TCCAAAATGG AATCTACTCA ACTAGCAACT 
AGCCCCGGGA CTTCTTACCT GACAGCATAT CAATCTATGT TGAATGGCGA GAGTCAAGCG 
ATGGTGTTCA ACGGAGTTTT TACCAATATT TTAGAAAATG AAGATCCAGG 
CTTTTCTTCA AAAGTGAAAA AAATATATAG TTTCAAAGTG ACTCAGACTG TTGAAACAGC 
TACTAAGCAG GTGAGTGGAG ATAGCTTTAA TATCTATATT AGTGGTATTG 
ATGCTTATGG ACCGATTTCT ACGGTCTCTC GTTCAGATGT CAATATCATT ATGACTGTCA 
ATCGTGCGAC ACATAAGATT TTATTGACAA CTACTCCACG AGATTCATAC 
GTTGCTTTCG CAGATGGCGG GCAAAATCAA TACGATAAAC TAACACATGC TGGTATTTAC 
GGTGTCAATG CTTCTGTGCA CACCTTAGAA AATTTTTATG GGATTGACAT 
TAGCAATTAT GTGCGGTTGA ACTTCATTTC CTTCCTTCAA TTAATCGACT TGGTGGGTGG 
AATTGATGTA TATAACGATC AAGAATTTAC AAGTTTACAT GGGAATTATC 
ATTTCCCTGT TGGACAAGTT CATTTAAACT CAGACCAAGC ATTAGGCTTC GTTCGAGAGC 
GCTACTCTTT AACAGGGGGT GACAATGACC GTGGTAAAAA CCAGGAAAAA 
GTGATTGCTG CCTTGATTAA AAAGATGAGT ACGCCAGAGA ATCTAAAAAA TTACCAGGCA 
ATCCTATCTG GATTGGAAGG CTCAATTCAA ACGGATTTGA GCTTAGAAAC 
GATTATGAGT TTAGTGAATA CCCAACTAGA ATCAGGAACA CAATTTACAG TAGAGTCACA 
AGCATTGACA GGAACAGGAC GCTCAGACTT ATCTTCTTAT GCGATGCCTG 
GATCACAACT TTATATGATG GAAATTAACC AAGATAGTCT GGAGCAATCA AAGGCAGCGA 
TTCAGTCCGT ACTTGTTGAA AAATAAAGAT TTTAGGAGAA AATATGAACA 
ATCAAGAAGT AAATGCAATC GAAATCGATG TTTTATTCTT ACTAAAAACA ATTTGGAGAA 
AGAAATTTTT AATTCTCTTA ACTGCAGTGT TGACTGCGGG GTTGGCATTT 
GTCTACAGTA GTTTTTTAGT GACACCTCAA TATGACTCCA CTACCCGTAT CTATGTAGTG 
AGTCAAAATG TTGAAGCCGG TGCGGGCTTG ACTAACCAAG AGTTACAAGC 
GGGTACCTAT T7GGCAAAAG ACTATCGGGA AATTATCCTA TCACAAGATG TATTGACACA 
AGTAGCAACG GAATTGAATC TGAAAGAGAG TTTGAAAGAA AAAATATCAG 
TTTCTATTCC TGTTGATACT CGTATCGTTT CTATTTCTGT GCGTGATGCG GATCCAAATG 
AAGCGGCACG TATTGCAAAT AGCCTTCGCA CCTTTGCAGT GCAAAAGGTT 
GTTGAGGTCA CCAAGGTAAG CGATGTGACG ACACTTGAAG AAGCAGTCCC AGCGGAAGAA 
CCAACCACTC CAAATACAAA ACGAAATATC TTGCTTGGTT TATTAGCTGG 
AGGTATCTTG GCAACAGGTC TTGTACTGGT TATGGAGGTT TTGGATGACC GTGTAAAACG 
TCCTCAGGAC ATCGAAGAGG TAATGGGATT GACATTGCTA GGTATAGTAC 
CAGATTCGAA GAAATTAAAA TAGGAGAACA ATATGGCGAT GTTAGAAATT GCACGTACAA 
AAAGAGAGGG AG T AAAT AAA ACCGAGGAGT ATTTCAATGC TATCCGTACC 
AATATTCAGC TTAGCGGAGC AGATATTAAG GTTGTTGGTA TTACCTCTGT TAAATCGAAT 
GAAGGTAAGA GTACAACTGC GGCTAGTCTC GCTATTGCCT ATGCTCGTTC 
AGGTTATAAG ACCGTCTTGG TGGATGCAGA TATCCGAAAT TCAGTCATGC CTGGTTTCTT 
CAAGCCAATT ACAAAGATTA CAGGTTTGAC GGATTACCTA GCAGGGACAA 
CAGACTTGTC TCAAGGATTA TGCGATACAG ATATTCCAAA CTTGACCGTA ATTGAGTCAG 
GAAAGGTTTC TCCCAACCCT ACTGCCCTTT TACAAAGTAA GAATTTTGAA 
AATCTACTTG CGACTCTTCG TCGCTATTAT GATTATGTTA TCGTTGACTG TCCACCATTA 
GGACTGGTAA TTGATGCAGC TATCATTGCA CAAAAATGTG ATGCGATGGT 
TGCAGTAGTA GAAGCAGGCA ATGTTAAGTG CTCATCTTTG AAAAAAGTAA AAGAGCAGTT 
GGAACAAACA GGCACACCGT TCTTAGGCGT TATCTTGAAC AAATATGATA 
TTGCCACTGA GAAGTATAGT GAATACGGAA ATTACGGCAA AAAAGCCTAA TTTCTCAGAT 
AACATAAGTT TGATAAG TAG GTATTAATAT GATTGATATC CATTCGCATA 
T CAT ATT TGG TGTGGATGAC GGTCCCAAAA CTATTGAAGA GAGCCTGAGT TTGATAAGCG 
AAGCTTATCG TCAAGGTGTT CGCTATATCG TAGCGACATC TCATAGACGA 
AAAGGGATGT TTGAAACACC AGAAAAAATC ATCATGATTA ACTTTCTTCA ACTTAAAGAG 
GC AG TAGCAG AAGTTTATCC TGAAATACGA TTGTGCTATG GTGCTGAATT 
GTATTATAGT AAAGATATCT TAAGCAAACT TGAAAAAAAG AAAGTACCAA CACTTAATGG 
CTCGTGCTAT ATTCTCTTGG AGTTCAGTAC GGATACTCCT TGGAAAGAGA 
TTCAAGAAGC AGTGAACGAA ATGACGCTAC TTGGGCTAAC TCCCGTACTT GCCCATATAG 
AGCGTTATGA TGCTCTGGCA TTTCAGTCAG AGAGAGTAGA AAAGCTAATT 
GACAAGGGAT GCTACACTCA GGTAAATAGT AACCATGTGT TGAAGCCTGC TTTAATTGGC 
GAACGAGCAA AAGAATTTAA AAAACGTACT CGATATTTTT TAGAGCAGGA 
TTTAGTACAT TGTGTTGCTA GCGATATGCA TAATTTATAT AGTAGACCTC CGTTTATGAG 
GGAGGCGTAT CAGCTTGTAA AAAAAGAGTA TGGTGAGGAT AGAGCGAAGG 
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CTTTGTTCAA GAAAAATCCT TTGTTGATAT TGAAAAATCA AGTACAGTAA CCTCATAGAA 
ATAGTGGAGG AGCTATGAAT ATTGAAATAG GATATCGCCA AACGAAATTG 
GCATTGTTTG ATATGATAGC AGTTACGATT TCTGCAATCT TAACAAGTCA TATACCAAAT 
GCTGATTTAA ATCGTTCTGG AATTTTTATC ATAATGATGG TTCATTATTT 
TGCATTTTTT ATATCTCGTA TGCCGGTTGA ATTTGAGTAT AGAGGTAATC TGATAGAGTT 
TGAAAAAACA TTTAACTATA GTATAATATT TGTAATTTTT CTTATGGCAG 
TTTCATTTAT GTTAGAGAAT AATTTCGCAC TTTCAAGACG TGGTGCCGTG TATTTCACAT 
TAATAAACTT CGTTTTGGTA TACCTATTTA ACGTAATTAT TAAGCAGTTT 
AAGGATAGCT TTCTATTTTC GACAACC TAT CAAAAAAAGA CGATTCTAAT TACAACGGCT 
GAAC TATGGG AAAA.TATGCA AGTTTTATTT GAATCAGATA TACTATTTCA 
AAAAAATCTT GTTGCATTGG TAATTTTAGG TACAGAAATA GATAAAATTA ATTTACCATT 
ACCGCTCTAT TATTCTGTTG AAGAAGCTAT AGGGTTTTCA ACAAGGGAAG 
TGGTCGACTA CGTCTTTATA AATTTACCAA GTGAATATTT TGACT TAAAG CAATTAGTTT 
CAGACTTTGA GTTGTTAGGT ATTGATGTAG GCGTTGATAT TAATTCATTC 
GGTTTTACTG T G T TGAAGAA TAAAAAAATC CAAATGCTAG GTGAC CATAG CATCGTCACT 
TTTTCCACAA ATTTTTATAA GCCTAGTCAC ATCTGGATGA AACG ACT TT T 
AGATATACTT GGAGCAGTAG TCGGGTTAAT TATTAGTGGT ATAGTTTCTA TTTTGTTAAT 
TCCAATTATT CGTAGAGATG GTGGGCCAGC CATTTTTGCT CAGAAACGAG 
TTGGACAGAA TGGACGCATA TTTACATTCT ACAAGTTTCG TTCGATGTTT GTTGATGCCG 
AGGTACGTAA GAAAGAAT T A ATGGCTCAAA ACCAGATGCA AGGTGGGATG 
TTCAAAATGG ACAACGATCC TAGAATTACT CCAATTGGAC ACTTCATACG AAAAACAAGT 
TTAGATGAGT TACCACAATT TTATAATGTT CTAATTGGAG AT ATGAGT CT 
AGTCGGTACC CGTCCGCCTA CAGTTGATGA ATTTGAAAAA TATACTCCTA GTCAAAAGAG 
AAGATTGAGT TTTAAACCAG GGATTACAGG TCTTTGGCAA GTGAGCGGAA 
GAAGTGATAT CACAGATTTT AATGAAGTCG TTAGGCTGGA CCTAACATAC ATTGATAATT 
GGACCATCTG GTCAGACATT AAGATTTTAT TGAAGACAGT GAAAGTTGTA 
TTGTTGAGAG AGGGAGGTCA GTAAGACTCC TTTAAAACAA AGAATAGTAG TAGGGGATAT 
GAGAACAGTT TATATTATTG GTTCAAAAGG AATACCAGCA AAGTATGGTG 
GTTTCGAGAC TTTCGTAGAA AAATTAACTG AGTATCAGAA AGATAAATCA ATTAATTATT 
TTGTTGCATG TACAAGAGAA AAT TCAGCAA AAT C AG AT AT TACAGGAGAA 
GTTTTTGAAC ATAATGGAGC AACATGTTTT AATATTGATG TGCCAAATAT TGGTTCAGCA 
AAAGCCATTG TTTATGATAT TATGGCTCTC AAGAAATCTA TTGAAATTGC 
CAAAGATAGA AATGATACCT CTCCAATTTT CTACATTCTT GCTTGTCGGA TTGGTCCTTT 
CATTTATCTT TTTAAGAAGC AGATTGAATC AATTGGAGGT CAACTTTTCG 
TAAACCCAGA CGGTCATGAA TGGCTACGTG AAAAGTGGAG TTATCCCGTC CGACAGTATT 
GGAAATTTTC TGAGAGTTTG ATGTTAAAAT ACGCTGATTT ACTAATTTGT 
GATAGCAAAA ATATTGAAAA ATATATTCAT GAAGATTATC GAAAATATGC TCCTGAAACA 
TCTTATATTG CTTATGGAAC AGACTTAGAT AAATCACGCC TTTCTCCGAC 
AGATAGTGTA GTACGTGAGT GGTATAAGGA GAAGGAAATT TCAGAAAATG ATTACTATTT 
GGTTGTTGGA CGATTTGTGC CTGAAAATAA CTATGAAGTA ATGATTCGAG 
AGTTTATGAA ATCATAT TCA AGAAAAGATT TTGTTTTGAT AACGAATGTA GAGCATAATT 
CCTTTTATGA G AAAT TGAAA AAAGAAACAG GGTTCGATAA AGATAAGCGT 
ATAAAGTTTG TTGGAACAGT CTATAATCAG GAGCTGTTAA AATATATTCG TGAAAATGCA 
TTTGCTTATT TTCATGGTCA CGAGGTTGGA GGAACGAACC CATCTTTACT 
TGAAGCACTT TCTTCTACTA AACTAAATCT TCTTCTAGAT GTGGGCTTTA ATAGAGAAGT 
AGGGGAAGAA GGAGCGAAAT ACT GG AAT AA AGATAATCTT CACAGAGTTA 
TTGACAGTTG TGAGCAATTA TCACAAGAAC AAATTAATGA TATGGATAGT TTATCAACAA 
AACAAGTCAA AG AAAG AT T T TCTTGGGATT TTATTGTTGA TGAGTATGAG 
AAGTTGTTTA AAGGATAAGT TATGAAAAAG ATTCTATATC TCCATGCTGG AGCAGAATTA 
TATGGGGCAG ATAAGGTTCT CTTGGAACTT ATAAAAGGCT TAGATAAGAA 
TGAATTTGAA GCGCATGTTA TCCTACCTAA TGATGGAGTC CTAGTGCCAG CAT TAAG AG A 
AGTTGGTGCG C AAGT TGAAG TTATTAACTA TCCAATTCTA CGTAG GAAAT 
ATTTTAATCC AAAAGGG AT T TTTGACTACT T CATATC ATA TCATCACTAT TCTAAACAGA 
TTGCTCAATA T GCCATAG AA AATAAGGTTG ACAT AAT TCA CAATAATACT 
ACCGCTGTCT TAGAAGGCAT TTATCTGAAG CGAAAACTCA AATTACCTTT GTTGTGGCAT 
GTTCATGAGA TTATTGTCAA ACCTAAATTC ATCTCTGATT CGATCAATTT 
TTTAATGGGG CGTTTTGCTG AT AAGAT TGT GACAGTTTCA CAGGCTGTGG CAAACCATAT 
AAAACAATCA CCTCATATCA AAGATGACCA AATCAGTGT A" ATCTACAATG 
GGGTAGATAA TAAAGTGTTT TATCAGTCCG ATGCTCGGTC TGTTCGAGAA AGATTTGACA 
TTGACGAAGA GGCTCTTGTC ATTGGTATGG TCGGTCGAGT CAATGCGTGG 
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AAAGGACAAG GAGATTTTTT AGAAGCAGTT GCTCCTATAC TCGAACAGAA TCCAAAAGCT 
ATCGCCTTTA TAGCAGGAAG TGCTTTTGAA GGAGAAGAGT GGCGAGTAGT 
AGAATTAGAA AAGAAGATTT CTCAATTAAA GGTCTCTTCT CAAGTCAGAC GAATGGATTA 
TTATGCAAAT ACCACTGAAT TATATAATAT GTTTGATATT TTTGTACTTC 
CAAGTACTAA TCCAGACCCT CTACCAACGG TTGTACTAAA AGCAATGGCA TGCGGTAAAC 
CTGTTGTCGG TTACCGACAT GGTGGTGTTT GTGAGATGGT GAAAGAAGGT 
GTTAACGGTT TCTTAGTCAC TCCGAACTCA CCGTTAAATT TATCAAAAGT AATTCTTCAG 
TTATCGGAAA ATATAAATCT CAGAAAAAAA ATTGGTAATA ATTCTATAGA 
ACGTCAAAAA GAACATT TT T CGTTAAAAAG CTATGTAAAA AATTTTTCGA AAGTCTACAC 
CTCCCTCAAA GTATACTGAT TGGCTGAAGT GAATGCTTTA G T AT AGCG AT 
TTATCGTATT CTCATTCGAT AAAACAAATG TTCAGAAACA GTTATAAGTT ATTTCTAAAG 
GGCACCTCTA TAAACTCCCA AAATTGCGAA TTTGGAGTTA CGAAAGCCTT 
GTTAAATCAA CATTTTAAAT TTTAGAAAAT TAGTTTTTAG AGCTCCCCTA AAATAGAAGA 
TAACAGAAGG GAGCCTTCAA AAACTTCATT TTTAATTGGA TTGTAGAAAA 
ACTGTTAAAT CAATATTTAG ATTTTTAGGA GTTCAGTTTT TGGGGGGAGA GCTTAATAAT 
CTATGCACTA TATTTCGAAA AATATATGGT GTAAAATCAG AACTGATGGT 
CGTGGCAAAA AAGAGAATGA GGAATTTATG AAAATTATTT CTTTTACAAT GGTTAATAAC 
GAAAGTGAGA TAATAGAGTC ATTTATACGG T ATAAT TATA ACTTTATTGA 
CGAGATGGTC ATTATTGATA ATGGTTGTAC AGATAACACG ATGCAAATTA TTTTTAATTT 
G AT TAAAG AG GGATATAAAA TATCCGTATA TGATGAGTCT TTAGAGGCAT 
ATAATCAGTA TCGACTTGAT AATAAATATC TAACGAAAAT AATTGCTGAA AAAAATCCAG 
ATTTGATAAT ACCTTTGGAT GCGGATGAAT TTTTAACAGC CGATTCAAAT 
CCACG GAAAC TTTTGGAACA ACTGGACTTA GAAAAGATAC ATTATGTGAA TTGGCAATGG 
TTTGTTATGA CTAAAAAAGA TGATATTAAT GATTCGTTTA TACCACGTAG 
AATGCAATAT TGTTTTGAAA AACCTGTTTG GCATCATTCT GATGGTAAAC CAGTTACTAA 
ATGTATAATT TCCGCTAAGT ATTACAAAAA AATGAATTTA AAGCTATCGA 
TGGGACATCA CACTGTTTTT GGTAACCCAA ATGTAAGGAT AGAACATCAT AATGATTTGA 
AATTTGCACA TTATCGAGCT ATTAGCCAAG AGCAAT TAAT TTATAAAACA 
ATTTGTTACA CTATTCGCGA TATTGCTACT ATGGAGAACA ATATCGAAAC AGCTCAAAGA 
ACAAATCAGA TGGCGCTCAT TGAATCTGGC GTGGATATGT GGGAAACGGC 
GAGAGAAGCC TCTTATTCAG GTTATGATTG TAATGT T ATA CATGCACCAA TTGATTTAAG 
TTTTTGTAAA GAAAATATTG TAATAAAATA TAACGAACTA TCCAGAGAAA 
CAGTAGCAGA ACGCGTGATG AAAACGGGAA GAGAAATGGC TGTTCGTGCA TATAATGTGG 
AGCGAAAACA AAAAGAAAAG AAATTTCTAA AACC TATTAT ATTTGTATTA 
GATGGGTTAA AAGGAGATGA GTATATTCAT CCCAATCCAT CAAATCATTT GACGATCTTA 
ACTGAAATGT ATAACGTCAG AGGCTTACTT ACCGATAATC ACCAAATTAA 
ATT TCTC AAA GTTAATTATA GATTAATTAT AACTCCAGAT TTTGCTAAGT TTTTACCGCA 
TGAATTTATT GTTGTACCAG ATACCTTGGA TATAGAGCAA GTTAAAAGCC 
AGTATGTTGG TACAGGTGTA GACTTGTCAA AG ATT ATT TC T T T AAAAG AG TATCGAAAAG 
AGATAGGCTT TATTGGTAAT TTGTATGCGC TTTTAGGATT TGTTCCGAAT 
ATGCTCAATA GAATTTATCT AT ATAT TCAG AGAAACGGTA TT GCAAAC AC TATTAT AAAA 
ATCAAGTCGA G ATT GT GAGA GTTGTTTACT TTTATTTGTA AT TTT AAAAG 
TAATGCAGGC AGATAGGAGA AAAACGTTTG GAAAAATGAG AATAAGAATT AATAATTTGT 
TTTTTGTTGC CATAGCGTTT ATGGGCATAA TTATTAGTAA TTCGCAAGTT 
GTTCTAGCGA TAGGCAAAGC TTCTGTGATT CAGTATCTAT CTTATTTAGT TTTGATTTTA 
TGTATAGTTA ATGATTTATT AAAAAATAAC AAACATATTG TAGTTTATAA 
ATTAGGGTAT TTGTTTCTTA TTATATTTTT ATTTACTATC GGAATATGTC AGCAAATTCT 
TCCTATAACA ACTAAAATAT AT TTATCAAT TTCAATGATG ATT ATT TCAG 
TTTTAGCAAC GTTGCCAATA AG T T TGATAA AAGATATTGA TGATTTTAGA CGGATTTCAA 
ATCATTTGTT ATTCGCTCTT TTTATAACTT CGATATTAGG AATAAAGATG 
GGGGCAACGA TGTTCACGGG GGCAGTAGAA GGTATCGGTT TTAGTCAGGG TTTTAATGGA 
GGATTGACGC ATAAGAACTT TTTTGGAATA ACTATTTTAA TGGGGTTCGT 
ATTAACTTAC TTGGCGTATA AGTATGGTTC C TAT AAAAG A ACGGATCGTT TTATTTTAGG 
ATTAGAATTG TTTTTGATTC T TAT TTC AAA CACACGCTCA GTTTATTTAA 
TACTATTGCT TTTTCTATTT CTTGTTAATC TTGACAAAAT CAAAATAGAA CAAAGACAAT 
GGAGTACGCT TAAATATATT TCCATGCTAT TTTGTGCTAT TTTTTTATAC 
TATTTCTTTG GTTTTTTAAT AACACATAGT GATTCTTACG CTCATCGCGT TAATGGTCTT 
ATTAATTTTT T TGAGT AT T A TAGAAATGAT TGGTTCCATC TAATGT TTGG 
TGCAGCGGAT TTGGCATATG GGGATTTAAC TTTAGACTAT GCTATAAGGG TTAGACGCGT 
TTTAGGTTGG AATGGAACGC TTGAAATGCC CTTACTGAGT ATTATGTTAA 
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AAAATGGTTT TATCGGTCTG 
TAAGAATATT AAAAACAGAT 
ATCATTGTAG TCCTATCTGC 
ATGCCAATAT GTTTTTGTTT 
TATTAACAAA CAACTGCAAA 
TGGTAGAGCA TATGTTCTAT 
ATGATGATTT TTATGATAGC 
AGTCAGCATT ATTGTACCTA 
GTTTAGATAG CATTATTTCC 
GTTCTTCAGA TTCATCAACG 
GGTAGAATAA AACTTTTCCG 
AT C AAAAAT A GCACAGCAAA 
TGTTGACGGC AACATTGTTG 
GTCGGGAGGG TTACTTGCTA 
TGCAAAAGTG TCAAATTGAT 
ATTTTCCCAA TCATTATATG 
CTTTATAAGA ATATATATAT 
TTATTATTTA ATCTAAATTA 
TAACAGAAAT CTTTATTTTG 
TGATGTTTTT ATTCAATTAG 
TTGTTAAAAT ATTTGGTGGA 
AT AT TAT T T A TTATAGCTTA 
CCAAAGAAAT TGCATATATT 
AT TAAAC GAA CGTCCTCTGT 
TAATAATTTG TTTAAAATTT 
ACATTTCTAT CATCGTCCCA 
TGTATAAATA GCATTGTAAA 
GGTAGTACGG ATAATTCGGA 
TAGTCGCATT CGTTATTTTA 
CATAAGTCGC GCCAAGGGTG 
TTATTCATTC GGAGTTCATC 
TGGCAGTTGC TGGTTATGAT 
GCAGAGCCGC TTCCTACAAA 
CTAGAGGCGG ATGGTCATCG 
AAAAGAACTA TTTGAAGATT 
CACTTATCGC TTGCTCTATG 
GCTTGTACTA TTATGTTGAC 
GCTTCCATTG CCTACTGGAA 
AGTAGAGGAG ATAAAGAGCT 
TTGTTTTTAG GCAAATATAA 
TCTCCAAACG CTATTTAGAA 
ACTAATGAAT GCTTATTATT 
TCTTTCTGAA AACGGGGAAA 
CTCGGTAAGA ATGTTGTAAT 
AATCCAACAA ATAGTAGAAT 
GTTATTTTTA CACATCTGGA 
CTTTCCGTAT TTCGTTGACA 
AAC G AATAAG TGGAATACAA 
GTGGTATAAA AGAAAGTATA 
TTTTATTGAG CTATTCGAGA 
GTTCTTCATC GCTCCGTTCA 
GTTGGGAGTT ACTATGTTCC 
AATTTTGTAT GTTCTTTTCG 
AAACTTTTCA GTGGATGCCA 
ATATATAGAC TAATATCACT 
AGCAGGATGT GCGTTCCAAG 
ATTATTGGAG CAATTCTGAT 
GGTTGGAAGT CTACTTCCTT 
ATTTTTTATG ATAAAGTATG 
TGCTTCTTAT CATATCTACT 

DNA Serotype 2 



8/61 

GTAGGGTATG GGATTGTTTT 
AATATAAAAA CAATAGGAAA 
AACAGTAGAA AATTATATTG 
ATTAAATTCT ATATCTACTA 
CATAAATTGG CAGGAATAGA 
AGGTGGCAAG ATAAAGATAG 
AAAGCAAGTT ACGGCATAAA 
TTTTTAATAC GGAAAAGTAC 
CAATCGTATA CTAATCTAGA 
GAT AT AT GT T TGGAATACGC 
GTTACCAAAT GGTGGTGTTT 
TTATATTATG TTTGTAGATT 
AGTCCTTATA CACCTGTTTA 
CTTTTGATGG AAATTATCAA 
TTGGAAGAGA TAAAAGAGGT 
AGCGGTATCT TTAATAGCCC 
AAACCAAGGT TTTGACACTG 
TTTAAAGAAT ATAAAAAAAG 
CCAGAAGAAG TTTACAAAGT 
AAAATTTAGA AGAAAAAACT 
CAATATGAAT TTTCTGTTTT 
TTAATGTTCA AAAATGGAGA 
TAAGTATTTA TACAATAGGC 
TTTTAAAAGA ATATGTAAAT 
TTTTAAATAC TTTAATTAGG 
ATTTACAATG TTGAACAATA 
TCAGACCTAC AAACATATAG 
AGAAATTTGT TTAG CATATG 
AAAAAGAGAA CGGCGGGCTA 
ACTACTTAGC TTTTATAGAC 
CAACGTTTAC ACGAAGCAAT 
AGGGTAGATG CTTCGGGGCA 
TCAGGCTGTT CTGAGCGGCA 
CTTTGTGGTG GCCTGGAATA 
TTCGATTTGA AAAGGGTAAG 
AGTTAGAAAA AGTTGCAATA 
C G AG AAAAT A GTATCATAAC 
TTTCAAAATG AACGAATGGA 
CTTACTAGAG TGTTATCGTT 
TCATTGGTTG AGCAAACAGC 
TTGTATATAA AC AAT TGAAG 
TGGTAGGGTG TCTTCATCTT 
GAT AAAATT C AAGAAAGATT 
AAATGGTTGA AAGAAAAGGG 
AGCACTCTTT GATACGATTA 
TTGGTCTGTT GAGCAGCGTC 
TGGCTGTTCC AATTTTTCTG 
AACAAGAGAC GCTAAAGCTC 
AACATGCTTT GTCTCTATGC 
ACCATCTGAT AGGAGTAAAG 
TTTGTCCTGT GGCTACTTTC 
GTTGTTGATT CAGGTAGTTT 
AGAAAAATAA ATGGTTGGGC 
TATTTGCTAA CATGGCTGAA 
TCGTTATCTT TTTGTTCTAG 
GTAGATACTT TCATTGCGAC 
TTTTGTGAAT CATTCTATAG 
TCTATGCGTC CCATTTGCGT 
GACAGAAGAT TCCAGCAATA 
TG AC CCAGAT GCTGTATTTT 

FIG. 3E 



ATATAAACTT TATCGTAATG 
GTCTGTATTT 

TAAATTTAAG TTTTGTATTT 
TGGAATCAAC 

GTTTTGAGTT GCTATTAATT 
TATTTTTTAC 

AG GAATT AGA GGATGGAAAA 
TTAAGAGAGT 

GATTCTTTTG ATAGATGACG 
AGAGCAAGAT 

CAAACGCAAG GAATTACGGT 
CTGATGATAT 

AAAGAGAATG ATAGTGATTT 
GAATCTGAGC 

GCGAGACTTA GGAAATGAAA 
TTGTTGCAAA 

AACAGTGGTT AG G AG AGGAC 
TCCGCTATGT 

ACTACAAATA CGTTTAAATA 
TTTGATTTGT 

TAAAGAGACG CTACAGTGGC 
TGAATCGCTT 

ATTCTTTAGA TACTCTAAGT 
TAATTGTTGC 

GAAGAAAAAA ATAATGATTA 
TCTATCCAAG 

AGATTCTTCT GGTGAATGAC 
CGAAGAAAGA 

TCAGATGCCC GTAATTATGG 
TCAGATGATT 

T GAGAG AG AG AATGCCCTTG 
TTTCTTAACA 

GGAATGTTTG TAAAAAGCTG 
AACTCTATAA 

ATTCATGAAG ATGAATACTT 
GTTAAGGAGT 

TTCTAGTATG AC T GACCATC 
CTTCTATGAA 

CATTTTTAGC CTTTGCTGTT 
AAAAGAAGCT 

CAAAATAAGC GACTTGCTTT 
AATTTTAGTG 

GAGAAGAAGT GAAAGTAGTA 
GATTAAAATG 

AATGTAT CAT GGTACTTTGT 
AATGGTTTAT 

TTGCTTTCTG CCTATTTTCG 
AAGTTCAGCA 

TATCGTGATG GCTGTTAATG 
CCTTTTTCAG 

TGGAGAATCG GGTCCAGGGA 
TTTTATTACC 

TTGCTTACTT GTTTTTTAGT 
CACGGCATAT 

GGCTTGGTTT TTTCTTTCAA 
CCTATTTGGG 

AGCCCTTCTC CTGGTTTTAT 
ATGCTATGCT 

CTGTTGTCAA AATTGGGAGT 
TCAGTAGTCG 
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CACCATTTTT AGCAGTGCAA TTTAAGGTAT CTTCGTTGAA TTTGTGGAAC GGCTTGTTTA 
CCTTTCTAAT TTGCCTGTTT GGTGGCTATA TTTTCTACAA AGTGGATCTG 
TTTATGAGAG TACGTGGAAA ACGATAATGA CTCATTTCAG AT T AGCAGAT GCCATTTCGT 
TTATTAGCAG ATTCGCATGT TAATATTCCG ACAAAGAAAT TCAAATAGGT 
TGACGAGAGA GGAGTGGTAT CTGTTTCTAA ACCCCAGTAT CCCCCTTTAT TTTCAAAGCT 
ATATTTATTA ACTGAACAAG GAGAATTTTT AAGAGAACTG TTTGTTTAAT 
CCCAGCACGA TCTGGTTCGA AAGGCTTACC GAATAAAAAC ATGCTATTTT TGGACGGGAA 
ACCCATGATT TTTCACACGA TTGATGTGGC AATTGAATCA GGTTGTTTTG 
AGAAAGAAGA CATCTATGTC AGTACGGATT CAGAAAT GT A TAAGGGGGGC ACCTCTATAA 
AT T CCCAAAA TTGCGAATTT GGAGTTACGA AAGCCTTGTT AAATCAACAT 
CTTAAATTTT AGAAAATTAG TTTTTAGAGG TCCCCAAGGG GATTTGCGAG ACAAGAGGCA 
TCAATGTATT GTTAAGACCC AAAGAACTAT CTACTTATCA TACTCCATCG 
AATGAAGTCA GTACGCACTT TTTTACGAAT CTGGATTTTA TGAAGATTGT ATATTTGTTC 
TTCTGCAAGT CACCTCACCG TTACGGACTG GCGAACAGAT AAAAGAAGCC 
ATGAATATGT ACTTACAGGG GGACTCAGAA AATGTTTTGC ATTTCAATGA TGAAGGGCAA 
GAAAGAGTGA AT C AGT AC AT TATCGAAGCT GTACAGGGGT TATAAAAAGG 
GGTTACTTAT CCTTAAAGTC TGTATGTAGA AGGAGAAAAA TTGAGACGAA TTTATATTTG 
CCATACGATG TATCAGATCC TGATTTCCTT GTTAAAGATG GACGTTGAGA 
GAGATAGTTT GATGTCCGTT GATATCATCG GGCATTTTCC AGATGTCAGG GAGCAACTGC 
AGCAGCATGT TCATCTAATC GAGGGAGACG GAGCGTTCAT TTGATCTATA 
TTCTTTGATA GCTAGATCAA AAACAAAAGA ACGCCTTTCC TTGTTACAGA GCTATGACGA 
GGTGATCATT TTTCAAGATC ACCGTCAAGT CGGTCATTTT TTAAATAAAC 
ATCGGATTCC CTATTCTCTT TTGGAGGATG GTTATAATTT TTTCAAGGAT AAAAGAGTGT 
GCGATTTGGA GTCAATTCAA TCATCTGTCT GGAAAAGACT CTTTTATCAA 
TGGTATTTTA AACCAACATA TTTGATTGGT TCAAGTCTCT ATTGTCAATC CATTGAGGTC 
AATGATCTGT CGCTCGTACA ATTTGACTAG GCTTATAAAC CCTTTGTAGA 
AGTTCCGAGA AAGCAATTAT TTGATCAAGC ATCGCCAGAG AAGGTGCAAG CGCTGCTGCA 
GATATTTGGA GCAAGGGCGA TAGTAGCGGA TGAAGAGTCT TCTCAAAAAC 
GATTGCTATT ATTGACCCAG CCCTTGTCTT GGGATTATCA TGTGACCGAA GAGAGTTGTT 
GGAGATTTAT GTAGCAGGTC TTGCCCCTTA TCGGGAAGAC TATACAATCT 
ACATAAAACC GCACCCACGA GATGGGGTTG ATTATTCATT TCTGGGTAAG GCTGTGGTGC 
TTCTGCCTCA AGGTATTCCG TTTGAGTTGT TCGAAATGGC AGGTAATATC 
CGTTTTGATA TCGGTATGAC CTATAGTTCG TCTGCTTTAG ATTTTTTAAA TTGTTTTGAA 
GAGAAAGTGT AT TT AAAGG A CACTTTTCCT CTTCTTTCAA AAAAT GAT AT 
TTTGCGTGAG GGGATAGAAT AGGAGGATTC ATGTCTAAAA AATCAATAGT TGTCTCAGGT 
CTCGTCTATA C GAT T GGAAC CATCCTCGTT CAGGGATTAG CCTTCATTAC 
CGTCCCCATC TATACTCGTG TCATTTCTCA GGAAGTATAT GGGCAGTTTA GCTTGTATAA 
TTCGTGGGTG GGGCTAGTTG GTCTCTTTAT CGGTCTACAG TTAGGTGGGG 
CTTTTGGCCC GGGATGGGTA CACTTCCGCG AGAAATTTGA TGATTTCGTA TCCACCTTGA 
TGGTCTCTTC TATCGCTTTC TTTTTACCAA TTTTTGGGCT ATCTTTTCTC 
CTCAGTCAGC CCCTATCGCT CCTATTTGGT TTGCCTGATT GGGTCGTTCC GCTTTACTTT 
TTGCAAAGTT TTATGAGTGT TGTGCAAGGA TTTTTTACGA CCTATTTAGT 
GCAGCGGCAG CAGTCCATGT GGACTTTACT CCTATCGGTA CTGAGCGCTG TTATCAACAC 
TGCTTTATCT TTATTTCTCA TCTTTTCGAT GGAGAATGAT TTCATCGCTC 
GTGTAATGGC AAACTCGGCA ACGACTGGTG TTTTTGCTTG TGTGTCCTTG TTGTTTTTCT 
ATAAGAAGAT TGGGCTTCAT TTTCGAAAGG ACTATCTTCG GTATGGTTTA 
AGTATATCGA TTCCTCTTAT TTTTCATGGA TTAGGTCATA ATGTACTCAA TCAATTTGAC 
AGAATCATGC TCGGCAAGAT GCTAACACTG TCAGATGTAG CCCTATACAG 
TTTCGGCTAC ACACTTGCGT CTATCTTACA AATTGTGTTT TCGAGCTTGA ATACGGTATG 
GTGTCCGTGG TATTTTGAGA AAAAGAGAGG TGCAGATAAA GATTTGCTCA 
GTTATGTCCG TTACTATCTG GCGATTGGCC TGTTTGTGAC TTTTGGATTT CTAACAATTT 
ACCCTGAATT AGCGATGTTG TTAGGTGGAT CTGAGTATCG TTTCAGTATG 
GGATTTATTC CCATGATTAT TGTCGGGGTG TTCTTTGTAT TTCTTTATAG TTTTCCAGCC 
AATATCCAGT TTTATAGTGG AAATACAAAG TTTTTGCCAA TTGGTACTTT 
TATAGCAGGT GTACTAAATA TTTCCGTCCA CTTTGTTTTG ATACCGACAA AGAAT TTATG 
GTGCTGCTTT GCAACGACTG CTTCCTATCT GTTGTTGCTA GTCTTGCATT 
ATTTTGTTGC TAAGAAAAAG TATGCTTACG ATGAAGTTGC GATTTCAACA TTTGTTAAGG 
TAATTGCTCT TGTTGTCGTC TATACAGGCT TGATGACAGT ATTTGTCGGT 
TCAATCTGGA TTCGTTGGTC AC TAGGAATA GCGGTTCTAG TCGTTTATGC CTACATTTTT 
AGAAAGGAAT TAACAGTTGC CCTCAATACA TTCAGGGAAA AACGGTCTAA 
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ATAAGGGCAC CTCTATAAAC TCCCAAAATT 
ATCAAACATT TTAAATTTTA GAAAATTAGT 
AAACGTCCCA AATGAGAGGT GCTCATAAGA 
CAAGTATTCT CTACCATGAA AATTGTGCTA 
AATGTTTCTT AAAGGACGTA TGCGCCTCTG 
CTAAAAATTG GGTAGAAAAG CAGATTAAAC 
TCGTGTTGAA GAGCAGGCTT TAGAAGCAAC 
GGCTATTTTT TCTAATCGGC TATCAGAAGT 
TGTTCTTTTA CTACTTAAGG AAAACCAAGC 
ACAGTCATTT TTAGAAAGGA AATAAAATGG 
GGTTGTAATC ACAACGGTGA TGTTCATCTA 
TGTGGTGTGG ATGCCGTTAA ATTTCAGACA 
TTCAAAATAC GCACCAAAGG CCGAATACCA 
GCTCGAAATG ACTCGTCGTT TGGAATTGAG 
TGCGTGATTA CTGTCTTGAA AAGGGAGTTG 
CATTGGACTT CTTGATTAGC ACAGATATGC 
GGTGAGATTA CCAATCTTCC CTATTTGGAA 
CTTTCAACTG GTATGGCTGT TATGGATGAA 
TTTGCAGGAA AATGGAACGA CCGATATTTC 
CCCTTACCCT GCTTTGAATT TGAATGTCTT 
TTCCAAACTT AACAATTGGC TATTCAGACC 
CTGCAGCAAT GGGAGCTGAA TTGATTGAAA 
GAAATGGAAG GACC AG AT C A TAAAGCGAGT 
AAAGGAGTGA GGATAGTGGA ACAATCTCTT 
AGAAGAAGTT GAAGTACGAA ATAAAATTGT 
AATTGCTAAA GGCGAAGTCT TTACAGAAGA 
CAGGAAATGG AATTTCGCCA ATGGAATGGT 
ATTTTGAGGA AGACCAAAAT ATTTGCCATA 
TAAGCGGAGT AAGGATGAAA AAAATTTGTT 
TTATGCGTCG CTTATTGAGC TATCTACAGG 
GATCTTGTAG T G AC AGCCAT GCATCTAGAA 
GAAGC GGACA AGCG TAGG AT TGTCAAGCGG 
TACGTCTAAG CAGACAATCG TCAAATCTTT 
TTTTGAAGAA GTCCAGTATG ACTTGGTGTT 
AGATGCTACC AGTTGCCAAT GCTGCGTTGC 
GTGGTGAAAA AACCATGGGA AATTTTGATG 
ACCAAGATGA GTCACCTTCA TCTGACATCA 
CTAGGAGAAA ATCCAACCAT GTACTGAACA 
AATGTTTTAA AACAAGACTT TTTGACAAGA 
TTTGCCGAGG ATT ACT AT GT TGTACTCTTT 
TAACACAGCC GAAGAACAAA CGCAGGCCTT 
GTGTTTGATA ATTGGATCCA ATTCGGATAC 
AATTGATGCA TGAATTTGTA AAAC AAG AC T 
CTCGTTATTA CCATTCCTTG GTCAAGCATT 
TCTTCGTCAG GTTTGATTGA AGTGCCCTCA 
CGCCAATTTG GACGTTTGTC AGGACCGAGT 
TAAGGAAGCG ATTGTTGGTG GTTTGGGGCA 
AT TTGAACAA CCTGATTCTG CTTTACAAGG 
TTTTATCTGT ACAGGCCTCA ACCATGAAAG 
ATGAAAAAAG TAGCCTTTCT AGGAGCGGGT 
TCCTTGGTTG GATAGAACTC GATATGAACT 
TGACTATCGT GGCTATCCTG TATTTGGTCC 
ATTTGGATGA TGGAAAAGTA GATGCTGTCT 
AGGAAATCTT TGACTTGCTT GCCAAAGATC 
ATCATTAGCG AGCAAGCCAA TATTTTTTCC 
ATAGGTTTTT CAAGTTTTGT AGGAGCCGAT 
TATCATCAAT ACGGGTGCCA TTGTGGAACA 
TACTCCAGGA GT G ACCATAA ATGGCTTGTG 
AT AT TGGAAG TGGTTCAACA GTGATTCAAT 
GGGCAGGGAC AGTTGTTTTG AAATCGTTGA 



GCGAATTTGG AGTTACGAAA GCCTTGTTAA 
TTTTAGAGGT CCCCATATAA 
ATTGACCATC ACTGCCATCT ACCCAAAGTT 
TAATCAAGTA TAAAGAAGGG 
CTTATGCCAG AAGTCATGAG GTAAATCTCC 
TTCCACCAAT CTATTGAAGA 
AAGCCCTGAG ACTATTCGAA AGAAATCTAG 
GAAGTAGCGA TCTTTATTAG 
TGCTCCCTCA AGACTT TATG GGAGCGATTT 
TTTATATTAT TGCAGAAATT 
GCACGGAAAA TGGTAGAAGT TGCCGTTGAT 
TTTAAGGCAG ATTTGTTGAT 
AAAAAT T AC A ACAGGAGAGT CAGAT TCTGA 
CTTTGAAGAG TATCTTGATT 
ATGTGTTTTC GACACCTTTT GATGAGGAAT 
CCGTTTATAA GATTCCATCT 
AAAATTGGTC GTCAAGCTAA GAAAGTTATT 
ATTCATCAAG CGGTGAAGAT 
GATTTTGCAT TGTACAACCG AGTATCCAAC 
GCATACCTTG AAAAAAGAAT 
ATAGTGTTGG TTCAGAAGTA CCCATCGCTG 
AGCACTTTAC TCTGGACAAT 
GCTACTCCTG ATATCTTAGC AGCCTTGGTA 
GGTAAATTTG AAAAAGAGCC 
AGCTAGAAAA TCTATTGTTG CCAAAAAAGC 
AAACATCACT GTCAAAAGAC 
ACAAAGTCTT GGGGCAGGTG AGTGAGCAGG 
GTGCTTTTGA AAATCAAftTG 
TTGTGACAGG CTCTCGTGCC GAATATGGGA 
ATGATCCAGA AATGGAGCTG 
GAAAAATATG GGATGACGGT CAAAGACATC 
ATTCCATTGC ATTTGACGGA 
AGCGACCTTG ACAGAGCAAC TCACGGTTCT 
GATTCTGGGG GAT C GC TAT G 
TTTATAATAT TCCTATTTGC CATATTCATG 
AGTCGATTCG CCATGCCATT 
ACGGATGAAT TTAGAAATCG TGTCATTCAA 
TCGGAGCTAT GGGTGTTGAA 
GAAGAGTTGG CGATGGAACT TGGAATTGAT 
CACCCTGTTA CCTTGGAGGA 
ATTAGATGCT CTAAAAGAAG ATGGTAGCCA 
ACATGCCGAT AAGATAATGG 
CTGATTCTTA CATCTTTACT TCGCTTCCAA 
CACAAGGTTT AATAGGGAAT 
TTACAGGTTC CGACCTTAAA TATTGGAAAT 
GTGGTACATG TTGGAACTTC 
ATTACGTGAT GTGATAGATT TTACCAATCC 
TTATCGAGCT ATCAAGGAAT 
AGTTTTATGA TAGATAGGGG AGAAAGTTTG 
ACCTTTTCAG ATGGTGTCCT 
CATTGGATAT TTTGAAGATA AACCGATCAG 
CTTGCAAGAT GTCCTAACCT 
TCGTCACTAT AGGTGACAAT GTCAAGCGCA 
ATTATGATGC TTTGTTCAAC 
CCAGATAGTA TCAAGGGACG AGGGGTTTTC 
TCCTATGTCT ATGACAATTG 
TCATACCACG GTGGAGGCCC ATTGTAACAT 
CCGTATCGGA GAAAGCACTT 
GTATCGAGAT TGCACCTTAT ACAACATTGG 
CGGAGTCAGG GACCTATGTT 
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GGTGTACCTG CTAGAAAGAT TAAATAGGTG AATTGATGGA ACCAATTTGT CTGATTCCTG 
CTCGGTCAGG ATCAAAAGGT TTACCAAATA AAAACATGTT ATTTTTAGAT 
GGTGTACCGA TGATTTTCCA TACCATTCGA GCTGCGATTG AGTCTGGATG TTTTAAGAAA 
GAAAATATAT ATGTCAGTAC TGATTCAGAG GTTTACAAGG AAATTTGTGA 
AACAACTGGG GTTCAAGTCC TCATGCGTCC AGCTGACTTG GCGACAGATT TTACAACCTC 
TTTTCAACTG AACGAACATT TTTTACAAGA TTTTTCTGAT G AC CAAGT AT 
TTGTTCTCCT GCAAGTTACG TCCCCATTAA GATCGGGAAA ACATG TCAAG GAGGCGATGG 
AGTTATATGG GAAAGGTCAA GCTGACCACG TTGTTAGCTT TACCAAAGTC 
GATAAGTCTC CAACATTGTT TTCAACTTTA GACGAAAACG GATTCGCTAA GGATATTGCA 
GGATTAGGTG GCAGTTATCG TCGTCAAGAT GAGAAAACAC TCTACTATCC 
TAATGGAGCG ATTTATATTT CTTCTAAGCA GGCTTATTTA GCGGATAAAA CTTATTTTTC 
TGAAAAAACA GCGGCCTATG TGATGACGAA GGAAGATTCG ATTGATGTAG 
ATGATCACTT TGATTTTACT GGTGTTATTG GTCGAATTTA CTTTGATTAC CAGCGTCGTG 
AGCAACAAAA CAAACCATTT TATAAAAGAG AGTTAAAGCG TTTATGTGAG 
CAACGAGTCC ATGATAGTCT TGTGATTGGC GATAGTCGTC TGTTAGCCTT GTTACTGGAT 
GGTTTCGATA ATATCAGCAT CGGTGGGATG ACAGCTTCGA CAGCACTTGA 
AAACCAAGGT CTCTTTTTGG CTACTCCGAT AAAGAAAGTT TTGCTTTCTC TTGGTGTGAA 
TGATTTGATT ACTGACTATC CCTTGCATAT GATTGAGGAT ACTATTCGCC 
AGCTGATGGA AAGTCTTGTT TCCAAAGCAG AGCAGGTTTT TGTGACGACG ATTGCCTACA 
CGCTGTTTCG TGATAGCGTT TCCAATGAAG AAATTGTGCA GCTGAATGAC 
GTTATTGTTC AGTCAGCAAG TGAACTGGGT ATTTCAGTGA TTGATCTAAA TGAAGTTGTT 
GAAAAAGAGG CGATGCTTGA CTATCAGTAT ACCAATGATG GATTGCATTT 
CAATCAGATT GGACAAGAGC GTGTGAATCA GCTGATTTTG ACAAGTTTGA CAAGATAATT 
TGGTGATAGA AGCTATTTCA GTGGCTAGAC TATGTTGGTA TGTGTTTTAG 
AGCCCAGGAA TAACATCTGT AGAGGATGCT AGCCTTGAGA ATT GACAACC ATTTAGTTGT 
TTTAATTATA TAAGGGGACC TCTAAAAACT CCCTAAATTT CCCAAAAATG 
AGATAATAGA ATAAAAAGTA AT G AGG AG AG CTGTCATGCA TTTATTCACA GACGATGAAA 
AAATCTTGTC AAAACTATCA GAGAAAGGCA ATCCCTTAGA ACGTTTGGAT 
GCCGTTATGG ATTGGAATAT CTTTCTTCCA TTGTTGTCAG AG T TAT T C AG TCGTAAAGAT 
AAAGTCATCA GTCGTGGCGG TCGTCCTCAC CTAGACTATC TCATGATGTT 
CAAAGCGCTC TTGCTTCAAC GTCTTCATAA CCTATCTGAC GATGCCATGG AAT AT CAACT 
GCTGGATCGT ATATCTTTTC GTCGTTTTGT TGGTTGTCAT GAAGACACTG 
TTCCCGATGC GAAAACTATC TGGCTCTATC GTGAGAAATT AACCAAGTCA GGTCGTGAAA 
AGGAGTTGTT CGATTTGTTC TATGCCCATC TCACAGATGA AGGGGTGATT 
GCCCATTCAG GTCAGATTGT GGATGCTACC TTTGTCG AAT GCCCTAAACA ACGCAATTCA 
CGTGAGGACA ATCAGAAAAT CAAAACT T AT CGAAAATTAT GAGGTCACAA 
CAGCTAGTGT ACACGACTCC AATGTCCTAG CTCCTCTTTG T GAT GC C AAT GAAGCGGTTT 
TTGATGACAG TGCTTATGTT GG AAAAT C AG TACCAGAAGG TTGTCGCCAC 
CACACGATTC GTCGTGCTTT TAGAAATAAA CCGTTGACTG AGACTGATAA GGTCATTAAT 
CGACATATTA CCAAAGTCCG TTGTCGCGTT GAGCATGGTT TTGGCTTCAT 
TGAAACTAAC ATGAAAGG T A ACATCTGTCG AGCAATTGGG AAGGCACGAG CTGAAACCAA 
TGT GACCTTA ACCAACCTGC TCTACAATAT CTGTCGTTTT GAGCAAATCA 
AACGACTGGG ATTACCATCC GTGGGCTTAG TGCGCCCAAA AAATAGGAAA ATAAGCAAAA 
AGAGGCTGGG CAAAAACTAG TTTCTCACAA TAAAAAAACG GCTCTTTGTC 
AACTGTAGTG GGTAGACGAA AAGCTAACAC CTAGAGAGGA CGAAATTCGT TCTCTCATTT 
TTGATGTTTA AAGCGTAACC GCCTAATAAC AAGGTATCTA TCCAATCACA 
CATTCCTCCA TTATATAGTT AAATGAAACA AAAAC AG T AC ATCTATGATA TAATGTATTT 
ATGGCATATT CAT T AGATT T TCGTAAAAAA GTTCTCGCAT ACTGTGAGAA 
AACCGGCAGT ATTACTGAAG CATCAGCTAT TTTCCAAGTT TCACGTAACA CTATCTATCA 
ATGGCTAAAA T TAAAAG AG A AAACCGGCGA GCTTCATCAC CAAGTTAAAG 
GAACCAAGCC AAGAAAAGTG GATAGAGATA AATTAAAGAA TTATCTTGAA ACTCAT CCAG 
ATGCTTATTT GACTGAAATA GCTTCTGAAT TTGACTGTCA TCCAACAGCT 
ATTCATTACC CCCTCAAAGC TATGGGATAT ACTCGAAAAA AAAGAGCTGT ACCTACTATG 
AACAAGACCC TGAAAAAGTA GAACTGTTCC TTAAAGAATT GAATAACTTA 
AGCCACTTGA CTCCTGTTTA TATTGACGAG ACAGGGTTTG AGACAT AT T T TCATCGAAAA 
TATGGTCGCT CTTTGAAAGG TCAGTTGATA AAAGGTAAGG TCTCTGGAAG 
AAG AT AC C AG CGGATATCTT TAGTAGCAGG TCTCATAAAT GGTGCGCTTA TAGCCCCGAT 
GACATACAAA G AT ACT AT G A CGAGTGGCTT TTTCGAAGCT T 
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SLDI DHMMEVMEASKSAAGSACPS PQAYQAAFEGAENI I WT ITGGLSGS FNAARVARDM 
YIEEHPNVNIHLIDSLSASGEMDLLVHQINRLISAGLDFPQWEAITHYREHSKLLFVLA 
KVDNLVKNGRLSKLVGTWGLLNIRMVGEASAEGKLELLQE<ARGHKKSVTAAFEEMKKAG 
YDGGRI VMAHRNNAKFFQQFSELVKAS FPTAVI DEVAT SGLCS FYAEEGGLLMGYEVKA 

0RF2Z 
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MKKYQVT I QDILTG IEEHRFKRGEKLPS IRQLREQYHCS KDTVQKAMLELKYQNKI YAVE 
KSGYYILEDRDFQDHTCRAQSYRLSRITYEDFRICLKESLIGRENYLFNYYHQQEGLAEL 
I S SVQSLLMDYHVYTKKDQLVI TAGSQQAL YI LTQMETLAGKTE I LI EN PT YSRMI ELIR 
HQGIPYQTIERNLDGIDLEELESIFQTGKIKFFYTIPRLHNPLGSTYDIATKTAIVKLAK 
QYDVYIIEDDYLADFDSSHSLPLHYLDTDNRVIYIKSFTPTLFPALRIGAI SLPNQLRDI 
FIKHKSLIDYDTNLIMQKALSLYIDNGMFARNTQHLHHIYHAQWNKIKDCLEKYALNIPY 
RIPKGSVTFQLSKGILSPSIQHMFGKCYYFSGQKADFLQIFFEQDFADKLEQFVRYLNE 



ORF2Y 
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MKI 1 1 PNAKEVNTNLENAS FTLLS DRSKPVLDAI SQFDVKKMAAFYKLNEAKAELEADRW 
YRIRTGQAKTYPAWQLYDGLMYRYMDRRGIDSKEENYLRDHVRVATALYGLIHPFEFISP 
HRLDFQGSLKIGNQSLKQYWRPYYDQEVGDDELILSLASSEFEQVFSPQIQKRLVKILFM 
EEKAGQLKVHSTISKKGRGRLLSWLAKNNIQELSDIQDFKVDGFEYCTSESTANQLTFXR 
SI KM 



0RF2X 
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MKKRSGRSKSSKFKLVNFALLGLYSI TLCLFLVTMYRYNI LDFRYLNYI VTLLLVGVAVL 
AGLLMWRKKARIFTALLLVFSLVITSVGIYGMQEWKFSTRLNSNSTFSEYEMSILVPAN 
SDITDVRQLTSILAPAEYDQDNITALLDDISKMESTQLATSPGTSYLTAYQSMLNGESQA 
MVFNGVFTNILENEDPGFSSKVKKIYSFKVTQTVETATKQVSGDSFNIYISGIDAYGPIS 
TVSRSDVNI IMTVNRATHKILLTTTPRDS YVAFADGGQNQ YDKLTHAGI YGVNAS VHTLE 
NFYGIDISNYVRLNFISFLQLIDLVGGIDVYNDQEFTSLHGNYHFPVGQVHLNSDQALGF 
VRERYSLTGGDKDRGKNQEKVIAALIKKMSTPENLKNYQAILSGLEGSIQTDLSLETIMS 
LVNTQLESGTQFTVESQALTGTGRSDLSSYAMPGSQLYMMEINQDSLEQSKAAIQSVLVE 
K 



CPS2A 
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MNNQEVNAI E I DVLFLLKT I WRKKFL I LLTAVLTAGLAFVY S S FLVTPQ YDS T T RI YWS 
QNVEAGAGI/TNQELQAGT YLAKD YRE I IL S QDVLTQVATELNLKE S LKE KI S VS I PVDTR 
IVSISVRDADPNEAARIANSLRTFAVQKWEVTKVSDVTTLEEAVPAEEPTTPNTKRNIL 
LGLLAGGILATGLVLVMEVLDDRVKRPQDIEEVMGLTLLGIVPDSKKLK 
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MAMLE I ARTKREGVNKTEE Y FNAI RTNI QLSGADI KWGI TS VKSNEGKST TAASLAI AY 
ARSGYKTVLVDADIRNSVMPGFFKPITKITGLTDYLAGTTDLSQGLCDTDIPNLTVIESG 
KVSPNPTALLQSKNFENLLATLRRYYDYVIVDCPPLGLVIDAAIIAQKCDAMVAWEAGN 
VKCSSLKKVKEQLEQTGTPFLGVILNKYDIATEKYSEYGNYGKKA 

CPS2C 

DNA Serotype 2 SEQ ID NO: 14 
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MIDIHSHIIFGVDDGPKTIEESLSLISEAYRQGVRYIVATSHRRKGMFETPEKIIMINFL 
QLKEAVAEVYPEIRLCYGAELYYSKDILSKLEKKKVPTLNGSCYILLEFSTDTPWKEIQE 
AVNEMTLLGLT PVLAHI ERYDALAFQSERVEKLI DKGC YTQVNSNHVLKPAL IGERAKEF 
KKRTRY FLEQDLVHC VAS DMHNL Y S RP P FMREAYQLVKKE Y GE DRAKALFKKNPLL I LKN 
QVQ 



CPS2D 
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MNIEIGYRQTKLALFDMIAVTISAILTSHIPNADLNRSGIFIIMMVHYFAFFISRMPVEF 
EYRGNLIEFEKTFNYSIIFVIFLMAVSFMLENNFALSRRGAVYFTLINFVLVYLFNVIIK 
QFKDSFLFSTTYQKKTILITTAELWENMQVLFESDILFQKNLVALVILGTEIDKINLPLP 
LYYSVEEAIGFSTREWDYVFXNLPSEYFDLKQLVSDFELLGIDVGVDINSFGFTVLKNK 
KIQMLGDHSIVTFSTNFYKPSHIWMKRLLDILGAWGLIISGIVSILLIPIIRRDGGPAI 
FAQKRVGQNGRIFTFYKFRSMFVDAEVRKKELMAQNQMQGGMFKMDNDPRITPIGHFIRK 
TSLDELPQFYNVLIGDMSLVGTRPPTVDEFEKYTPSQKRRLSFKPGITGLWQVSGRSDIT 
DFNE WRLDLT Y I DNWT IWS DI KI LLKTVKWLLREGGQ 



CPS2E 



DNA Serotype 2 SEQ ID NO:16 
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MRTVYI IGSKGIPAKYGGFETFVEKLTEYQKDKS INYFVACTRENSAKSDI TGEVFEHNG 
ATCFNIDVPNIGSAKAILYDIMALKKSIEIAKDRNDTSPIFYILACRIGPFIYLFKKQIE 
SIGGQLFVNPDGHEWLREKWSYPVRQYWKFSESLMLKYADLLICDSKNIEKYIHEDYRKY 
APETSYIAYGTDLDKSRLSPTDSWREWYKEKEISENDYYLWGRFVPENNYEVMIREFM 
KSYSRKDFVLITNVEHNSFYEKLKKETGFDKDKRIKFVGTVYNQELLKYIRENAFAYFHG 
HEVGGTNPSLLEALSSTKLNLLLDVGFNREVGEEGAKYWNKDNLHRVIDSCEQLSQEQIN 
DMDSLSTKQVKERFSWDFIVDEYEKLFKG 

CPS2F 



DNA Serotype 2 SEQ ID N0:17 
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MKKI LYLHAGAELYGADKVLLEL I KGLDKNEFEAH VI L> PN DG VLVPALREVGAQVEVINY 
PILRRKYFNPKGIFDYFISYHHYSKQIAQYAIENECVDIIHNNTTAVLEGIYLKRKLKLPL 
LWHVHEIIVKPKFISDSINFLMGRFADKIVTVSQAVANHIKQSPHIKDDQISVIYNGVDN 
KVFYQS DARS VRERFDI DEEALVI GMVGRVNAWKGQG DFLEAVAPI LEQNPKAI AFIAGS 
AFEGEEWRWELEKKISQLKVSSQVXRMDYYANTTELYNMFDIFVLPSTNPDPLPTWLK 
AMACGKPWG YRHGGVCEMVKEGVNGFLVT PNS PLNLSKVILQLSEN INLRKKI GNNS IE 
RQKEHFSLKSYVKNFSKVYTSLKVY 

CPS2G 



DNA Serotype 2 SEQ ID NO:18 



FIG. 3R 
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MKII S FTMVNNESEI IES FIRYN YNFI DEMVI IDNGCTDNTMQI I FNLIKEGYKISVYDE 
SLEAYNQYRLDNKYLTKI IAEKNPDLIIPLDADEFLTADSNPRKLLEQLDLEKIHYVNWQ 
WFVMTKKDDINDSFIPRRMQYCFEKPVWHHSDGKPVTKCIISAKYYKKMNLKLSMGHHTV 
FGNPNVRIEHHNDLKFAHYRAISQEQLIYKTICYTIRDIATMENNIETAQRTNQMALIES 
GVDMWETAREASYSGYDCNVIHAPIDLSFCKENIVIKYNELSRETVAERVMKTGREMAVR 
AYNVERKQKEKKFLKPI I FVLDGLKGDEYIHPNPSNHLTILTEMYNVRGLLTDNHQIKFL 
KVNYRLIITPDFAKFLPHEFIWPDTXDIEQVKSQYVGTGVDLSKIISLKEYRKEIGFIG 
NL YALLGFVPNMLNRI YLYIQRNG I ANT I IKIKSRL . 



CPS2H 



DNA Serotype 2 SEQ ID N0:19 
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MQADRRKTFGKMRI RI NNLFFVA I AFMG 1 1 1 SNS QWLAIGKAS V I QYLS YLVLI LC I VN 
DLLKNNKHIWYKLGYLFLIIFLFTIGICQQILPITTKIYLSISMMIISVLATLPISLIK 
DIDDFRRISNHLLFALFITSILGIKMGATMFTGAVEGIGFSQGFNGGLTHKNFFGITILM 
GFVLTYLAYKYGSYKRTDRFILGLELFLILISNTRSVYLILLLFLFLVNLDKIKIEQRQW 
STLKYISMLFCAIFLYYFFGFLITHSDSYAHRVNGLINFFEYYRNDWFHLMFGAADLAYG 
DLTLDYAIRVRRVLGWNGTLEMPLLSIMLKNGFIGLVGYGIVLYKLYRNVRILKTDNIKT 
IGKSVFIIWLSATVENYIVNLSFVFMPICFCLLNSISTMESTINKQLQT 



CPS 21 

DNA Serotype 2 SEQ ID NO:20 
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MEKVSIIVPIFNTEKYLRECLDSIISQSYTNLEILLIDDGSSDSSTDICLEYAEQDGRIK 
LFRLPNGGVSNARNYGIKNSTANYIMFVDSDDIVDGNIVESLYTCLKENDSDLSGGLIAT 
FDGNYQESELQKCQIDLEEIKEVRDLGNENFPNHYMSGIFNSPCCKLYKNIYINQGFDTE 
QWLGEDLLFNLNYLKNIKKVRYVNRNLYFARRSLQSTTNTFKYDVFIQLENLEEKTFDLF 
VKI FGGQYEFS VFKETLQWHI I Y YSLLMFKNGDES LPKKLHI FKYLYNRH S LDTLS IKRT 
SSVFKRICKLIVANNLFKIFLNTLIREEKNND 



CPS2J 



DNA Serotype 2 SEQ ID NO:21 
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MINISIIVPI YNVEQYLSKC INSIVNQTYK HIEILLVNDG STDNSEEICL AYAKKDSRIR 
YFKKENGGLS DARNYGISRA KGDYLAFIDS DDFIHSEFIQ RLHEAIEREN 
ALVAVAGYDR VDASGHFLTA EPLPTNQAVL SGRNVCKKLL EADGHRFWA WNKLYKKELF 
EDFRFEKGKI HEDEYFTYRL LYELEKVAIV KECLYYYVDR ENSIITSSMT 
DHRFHCLLEF QNERMDFYES RGDKELLLEC YRSFLAFAVL FLGKYNHWLS KQQKKLLQTL 
FRIVYKQLKQ NKRLALLMNA YYLVGCLHLN FSVFLKTGKD KIQERLRRSE 
SSTR 



CPS2K 



DNA Serotype 2 SEQ ID NO:22 



FIG. 3V 
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MSKKSIWSG LVYTIGTILV 
LGGAFGPGWV HFREKFDDFV 
LPDWWPLIF LQSLMIWQG 
FIARVMANPA TTGVLACVSX 
LGHNVLNQFD RIMLGKMLTL 
DLLSYVRYYL AIGLFVTFGF 
FFVFLYSFPA NIQFYSGNTK 
VLHYFVAKKK YAYDEVAIST 
AVLWYAYIF RKELTVALNT 



DNA Serotype 2 



QGLAFITLPI YTRVISQEVY 
STLMVSSIAF FLPIFGLSFL 
FFTTYLVQRQ QSMWTLPLSV 
WFSQKKNGLH FRKDYLRYGL 
SDVALYSFGY TLASILQIVF 
LTIYPELAML LGGSEYRFSM 
FLPIGTFIAG VLNISVHFVL 
FVKVIALVW YTGLMTVFVG 
FREKRSK 



GQFSLYNSWV GLVGLFIGLQ 
LSQPLSLLFG 

LSAVINTALS LFLTFPMEND 
SISIPLIFHG 

SSLNTVWCPW YFEKKRGADK 
GFIPMIIVGV 

IPTKNLWCCF ATTASYLLLL 
SIWIRWSLGI 



CPS 20 



SEQ ID NO:23 
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MVYIIAEIGC 
ESDSQLEMTR 
MPVYKIPSGE 
TEYPTPYPAL 
EKHFTLDNEM 
VAKKAIAKGE 
HSAFENQM 



NHNGDVHLAR 
RLELSFEEYL 
ITNLPYLEKI 
NLNVLHTLKK 
EGPDHKASAT 
VFTEENITVK 



KMVEVAVDCG 
DLRDYCLEKG 
GRQAKKVILS 
EFPNLTIGYS 
PDILAALVKG 
RPGNGISPME 



VDAVKFQTFK 
VDVFSTPFDE 
TGMAVMDEIH 
DHSVGSEVPI 
VRIVEQSLGK 
WYKVLGQVSE 



ADLLI SKYAP KAEYQKITTG 
ESLDFLISTD 

QAVKILQENG TTDISILHCT 
AAAAMGAELI 

FEKEPEEVEV RNKIVARKSI 
QDFEEDQNIC 



CPS2P 



DNA Serotype 2 



SEQ ID NO:24 
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MKKICFVTGS RAEYGIMRRL LSYLQDDPEM ELDLVVTAMH LEEKYGMTVK DIEADKRRIV 
KRIPLHLTDT SKQTIVKSLA TLTEQLTVLF EEVQYDLVLI LGDRYEMLPV 
ANAALLYNIP ICHIHGGEKT MGNFDESIRH AITKMSHLHL TSTDEFRNRV IQLGENPTMY 



CPS2Q 



DNA Serotype 2 SEQ ID NO:25 
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MELGIDFAED YYWLFHFVT LEDNTAEEQT QALLDALKEO GSQCLIIGSN SDTHADKIME 

LMHEFVKQDS DSYIFTSLPT RYYHSLVKHS QGLIGNSSSG LIEVPSLQVP 

TLNIGNRQFG RLSGPSWHV GTSKEAIVGG LGQLRDVIDF TNPFEQPDSA LQGYRAIKEF 
LSVQASTMKE FYDR 



CPS2R 



DNA Serotype 2 



SEQ ID NO:26 
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MKKVAFLGAG TFSDGVLPWL DRTRYELIGY FEDKPISDYR GYPVFGPLQD VLTYLDDGKV 
DAVFVTIGDN VKRKEI FDLL AKDHYDALFN IISEQANIFS PDSIKGRGVF 
IGFSSFVGAD SYVYDNCIIN TGAIVEHHTT VEAHCNITPG VTINGLCRIG ESTYIGSGST 
VIQCIEIAPY TTLGAGTWL KSLTESGTYV GVPARKIK 



CPS2S 



DNA Serotype 2 



SEQ ID NO:27 
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MEPICLIPAR 
CETTGVQVLM 
GKHVKEAMEL 
YPNGAIYISS 
IYFDYQRREQ 
LENQGLFLAT 
VFVTT IAYTL 
HFNQIGQERV 



SGSKGLPNKN 
RPADLATDFT 
YGKGQADHW 
KQAYLADKTY 
QNKPFYKREL 
PIKKVLLSLG 
FRDSVSNEEI 
NQLILTSLTR 



MLFLDGVPMI 
TSFQLNEHFL 
SFTKVDKSPT 
FSEKTAAYVM 
KRLCEQRVHD 
VNDLITDYPL 
VQLNDVIVQS 



FHTIRAAIES 
QDFSDDQVFV 
LFSTLDENGF 
TKEDSIDVDD 
SLVIGDSRLL 
HMIEDTIRQL 
ASELGISVID 



GCFKKENIYV 
LLQVTSPLRS 
AKDIAGLGGS 
HFDFTGVIGR 
ALLLDGFDNI 
MESLVSKAEQ 
LNEWEKEAM 



STDSEVYKEI 
YRRQDEKTLY 
SIGGMTASTA 
LDYQYTNDGL 

CPS2T 



DNA Serotype 2 



SEQ ID NO:28 
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ATCGCCAAAC GAAATTGGCA TTATTTGATA TGATAGCAGT TGCAATTTCT GCAATCTTAA CAAGTCATAT 
ACCAAATGCT GATTTAAATC GTTCTGGAAT TTTTATCATA 

ATGATGGTTC ATTATTTTGC ATT TTT TATA TCTCGTATGC CAGTTGAATT T GAGT AT AG A GGTAATCTGA 
TAGAGTTTGA AAAAACATTT AACTATAGTA TAATATTTGC 

AATTTTTCTT ACGGCAGTAT CATTTTTGTT GGAGAATAAT TTCGCACTTT CAAGACGTGG TGCCGTGTAT 
TTCACATTAA TAAACTTCGT TTTGGTATAC CTATTTAACG 

TAATTATTAA GCAGTTTAAG GATAGCTTTC TATTTTCGA'C AATCTATCAA AAAAAGACGA TTCTAATTAC 
AACGGCTGAA CGATGGGAAA ATATGCAAGT TTTATTTGAA 

TCACATAAAC AAATTCAAAA AAATCTTGTT GCATTGGTAG TTTTAGGTAC AGAAATAGAT AAAATTAATT 
TATCATTACC GCTCTATTAT TCTGTGGAAG AAGCTATAGA 

GTTTTCAACA AGGGAAGTGG TCGACCACGT CTTTATAAAT CTACCAAGTG AGTTTTTAGA CGTAAAGCAA 
TTCGTTTCAG ATTTTGAGTT GTTAGGTATT GATGTAAGCG 

TTGATATTAA TTCATTCGGT TTTACTGCGT TGAAAAACAA AAAAATCCAA CTGCTAGGTG ACCATAGCAT 
TGTAACTTTT TCCACAAATT TTTATAAGCC TAGTCATATC 

ATGATGAAAC GACTTTTGGA TATACTCGGA GCGGTAGTCG GGTTAATTAT TTGTGGTATA GTTTCTATTT 
TGTTAGTTCC AATTATTCGT AGAGATGGTG GACCGGCTAT 

TTTTGCTCAG AAACGAGTTG GACAGAATGG ACGCATATTT ACATTCTACA AGTTTCGATC GATGTATGTT 
GATGCTGAGG AGCGCAAAAA AGACTTGCTC AGCCAAAACC 

AG AT GCAAGG GTGGGTATGT TTTAAAATGG GAAAAACGAT CCTAGAATTA CTCCAATTGG ACATTTCATA 
CGCAAAAACA AGTTTAGACG AGTTACCACA GTTTTATAAT 

GTTTTAATTG GCGATATGAG TCTAGTTGGT ACACGTCCAC CTACAGTTGA TGAATTTGAA AAATATACTC 
CTGGTCAAAA GAGACGATTG AGTTTTAAAC CAGGGATTAC 

AGGTCTCTGG CAGGT TAG TG GTCGTAGTAA TATCACAGAC TTCGACGACG TAGTTCGGTT GGACTTAGCA 
TACATTGATA ATTGGACTAT CTGGTCAGAT ATTAAAATTT 

TATTAAAGAC AGTGAAAGTT GTATTGTTGA GAGAGGGAAG TAAGTAAAAG TATATGAAAG TTTGTTTGGT 
CGGTTCTTCA GGGGGACATT TGACTCACTT GTATTTGTTA 

AAACCGTTTT GGAAGGAAGA AGAACGTTTT TGGGTAACAT TTGATAAAGA GGATGCAAGA AGTCTTTTGA 
AGAATGAAAA AATGTATCCA TGTTACTTTC CAACAAATCG 

CAATCTCATT AATTTAGTGA AAAATACTTT CTTAGCTTTC AAAATTTTAC GTGATGAGAA ACCAGATGTT 
ATTATTTCAT CTGGTGCGGC CGTTGCTGTC CCCTTCTTTT 

ACATCGGAAA ACTATTTGGA GCAAAGACGA TTTATATTGA AGTATTTGAT CGAGTTAATA AATCTACATT 
AACTGGAAAA CTAGTTTATC CCGTAACAGA TATTTTTATT 

GTTCAGTGGG AAGAAATGAA GAAGGTATAT CCTAAATCTA TTAACTTGGG GAGTATTTTT TAATGATTTT 
TGTAACAGTA GGAACTCATG AACAACAGTT TAATCGATTG 

ATAAAAGAGA TTGATTTATT GAAAAAAAAT GGAAGTATAA CCGACGAAAT AT T TATTCAA ACAGGATATT 
CTGACTATAT TCCAGAATAT TGCAAGTATA AAAAATTTCT 

CAGTTACAAA GAAATGGAAC AATATATTAA CAAATCAGAA GTAGTTATTT GCCACGGAGG CCCCGCTACT 
TTTATGAATT CATTATCCAA AGGAAAAAAA CAATTATTGT 

TTCCTAGACA AAAAAAGTAT GGTGAACATG TAAATGATCA TCAAGTAGAG TTTGTAAGAA GAATTTTACA 
AGATAATAAT ATTTTATTTA TAGAAAATAT AGATGATTTG 

TTTGAAAAAA TTATTGAAGT TTCTAAGCAA ACTAACTTTA CATCAAATAA TAATTTTTTT TGTGAAAGAT 
TAAAACAAAT AGTTGAAAAA TTTAATGAGG ATCAAGAAAA 

TGAATAATAA AAAAGATGCA TATTTGATAA TGGCTTATCA TAATTTTTCT CAGATTTTAC TGGAGAGGGA 
TACAGATATT ATCATCTTCT CTCAGGAGAA TGCACACCAT 

TAGTTCCTTC AGAATACCTG TATAATTATT TTAAATATTC TCAGGATTTA TATGTTGAAT TTACAAAAGA 
TGAGCAAAAA TATAAAGAAA ATAGGATATA TGAACGAGTT 

AAATGTTACA GATTATTTCC TAAT AT AT C A GAAAAAACTA TTGATAATGT ACTGTTTAGA ATTTTATTAA 
GAATGTATCG AGCTTTTGAA TACTATTTAC AAAGATTGTT 

GTTTATTGAT AGAATAAAAA ACATGGTCTA AGAATAAGAT TTGGTTCTAA TTGGGTTTCG CTTCCACATG 
ATTTTGTGGC AATTCTTTTA TCAAATGAAA ACGAAACAGC 

TTATTTATTT AAGTAATCTA AATGTCCAGA TGAACTATTT ATACAGACAA TTATAGAAAA ATATGAATTT 
TCAAATAGAT TATCTAAATA TGGAAATTTA AGATATATAA 

AGTGGAAAAA ATCAACATCT TCTCCTATTG TCTTTACAGA TGATTCTATT GATGAATTGC TAAATGCAAG 
AAATTTAGGT TTTTTATTTG CTAGAAAGTT AAAAATAGAA 

AATAAAT CTA AATTTAAAGA AATTATTACT AAAAAATAAA ATAGTTGATT TTGTGAGAGT AATGTATGTT 
TAAATTATTT AAATATGACC CGGAATATTT TATTTTTAAG 

TACTTCTGGT TGATTATTTT TATTCCAGAG CAAAAGT AT G TATTTTTATT AATTTTTATG AATTTAATTT 
TATTTCATAT AAAATTTTTG AAAACTAAGC TAAT AT T AAA 

AAATGAAATT TTATTGTTTT TATTATGGTC TATATTATGT TTTGTTTCAG TAGTCACAAG TATGTTTGTT 
GAAATAAATT TTGAAAGATT AT TT GCAGAT TTTACTGCTC 

CCATAATTTG GATTATTGCA ATAATGTATT ATAATTTGTA TT CAT T TATA AATATTGATT ATAAAAAATT 
AAAAAATAGT ATCTTTTTTA GTTTTTTAGT TTTATTAGGT 

ATATCTGCAT TGTATATTAT TCAAAATGGG AAAGATATTG TATTTTTAGA CAGACACCTT AT AGG ACT AG 
ACTATCTTAT AACAGGCGTC AAAACAAGGT TGGTTGGCTT 

TATGAACTAT CCTACGTTAA ATACCACTAC AATTATAGTT TCAATTCCGT TAATCTTTGC ACT TAT AAAA 
AATAAAATGC AACAATTTTT TTTCTTGTGT CTTGCTTTTA 

DNA Serotype 1 FIG. 4A 
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TACCGATCTA TTTAAGTGGA TCGAGAATTG GTAGTTTATC GCTAGCAATA TTAATTATAT GCTTGTTATG 
GAGATATATA GGTGGAAAAT TTGCTTGGAT AAAAAAGCTA 

ATAGTAATAT TTGTAATACT ACTTATTATT TTAAATACTG AATTGCTTTA CCATGAAATT TTGGCTGTTT 
ATAATTCTAG AGAATCAAGT AACGAAGCTA GATTTATTAT 

TTATCAAGGA AGTATTGATA AAGTATTAGA AAACAATATT TTATTTGGAT ATGGAATATC CGAATATTCA 
GTTACGGGAA CTTGGCTCGG AAGTCATTCA GGCTATATAT 

CATTTTTTTA TAAATCAGGA ATAGTTGGGT TGATTTTACT GATGTTTTCT TTTTTTTATG TTATAAAAAA 
AAGTTATGGA GTTAATGGGG AAACAGCACT ATTTTATTTT 

ACATCATTAG CCATATTTTT CATATATGAA ACAATAGATC CGATTATTAT TATATTAGTA CTATTCTTTT 
CTTCAATAGG TATTTGGAAT AATATAAATT TTAAAAAGGA 

TAT GGAGACA AAAAATGAAT GATTTAATTT CAGTTATTGT ACCAATT TAT AATGTCCAAG ATTATCTTGA 
TAAATGTATT AACAGTATTA T TAACCAAAC ATATACTAAT 

TTAGAGGTTA TTCTCGTAAA TGATGGAAGT ACTGATGATT CTGAGAAAAT TTGCTTAAAC TATATGAAGA 
ACGATGGAAG AATTAAATAT TACAAGAAAA TTAATGGCGG 

TCTAGCAGAT GCTCGAAATT TCGGACTAGA ACATGCAACA GGTAAATATA TTGCTTTTGT CGATTCTGAT 
G AC TAT AT AG AAGTTGCAAT GTTCGAGAGA ATGCATGATA 

AT AT AACT G A GTATAATGCC GATATAGCAG AGATAGATTT TTGTTTAGTA GACGAAAACG GGTATACAAA 
GAAAAAAAGA AATAGTAATT TTCATGTCTT AACGAGAGAA 

GAGACTGTAA AAGAATTTTT GTCAGGATCT AATATAGAAA ATAATGTTTG GTGCAAGCTT TATTCACGAG 
AT AT T ATAAA AGATATAAAA TTCCAAATTA ATAATAGAAG 

TATTGGTGAG GATTTGCTTT TTAATTTGGA GGTCTTGAAC AATGTAACAC GTGTAGTAGT TGATACTAGA 
GAATATTATT ATAATTATGT CATTCGTAAC AGTTCGCTTA 

TTAATCAGAA ATTCTCTATA AATAATATTG ATTTAGTCAC AAGATTGGAG AATTACCCCT TTAAGTTAAA 
AAGAGAGTTT AGTCATTATT TTGATGCAAA AG TTAT T AAA 

GAGAAGGTTA AATGTTTAAA CAAAATGTAT TCAACAGATT GTTTGGATAA TGAGTTCTTG CCAATATTAG 
AGTCTTATCG AAAAGAAATA CGTAGATATC CATTTATTAA 

AGCGAAAAGA TATTTATCAA GAAAGCATTT AGTTACGTTG TATTTGATGA AATTTTCGCC TAAACTATAT 
GTAATGTTAT ATAAGAAATT TCAAAAGCAG TAGAGGTAAA 

AATGGATAAA ATTAGTGTTA TTGTTCCAGT TTATAATGTA GATAAATATT TAAGTAGTTG TATAGAAAGC 
ATTATTAATC AAAATTATAA AAATATAGAA AT AT T ATTGA 

TAGATGATGG CTCTGTAGAT GATTCTGCTA AAATATGCAA GGAATAT GC A GAAAAAGATA AAAGAGTAAA 
AATTTTTTTC ACTAATCATA GTGGAGTATC AAAT GCTAGA 

AAT CATGGAA TAAAGCGGAG TACAGCTGAA TATATTATGT TTGTTGACTC TGATGATGTT GTTGATAGTA 
GATTAGTAGA AAAATTATAT TTTAATATTA TAAAAAGTAG 

AAGTGATTTA TCTGGTTGTT TGTACGCTAC TTTTTCAGAA AATATAAATA ATTTTGAAGT GAATAATCCA 
AATATTGATT TTGAAGCAAT TAATACCGTG CAGGACATGG 

GAGAAAAAAA TTTTATGAAT TTGTATATAA ATAATATTTT TTCTACTCCT GTTTGTAAAC TATATAAGAA 
AAGATACATA ACAGATCTTT TTCAAGAGAA TCAATGGTTA 

GGAGAAGATT TACTTTTTAA TCTGCATTAT TTAAAGAATA TAGATAGAGT TAGTTATTTG AC TGAACAT C 
TTTATTTTTA TAGGAGAGGT ATACTAAGTA CAGTAAATTC 

TTTTAAAGAA GGTGTGTTTT TGCAATTGGA AAATTTGCAA AAACAAG TG A TAGTATTGTT TAAGCAAATA 
TATGGTGAGG ATTTTGACGT ATCAATTGTT AAAGATACTA 

TACGTTGGCA AGTATTTTAT TATAGCTTAC TAATGTTTAA ATACGGAAAA CAGTCTATTT TTGACAAATT 
TTTAATTTTT AGAAATCTTT ATAAAAAATA TTATTTTAAC 

TTGTTAAAAG TATCTAACAA AAATTCTTTG TCTAAAAATT TTTGTATAAG AATTGTTTCG AACAAAGTTT 
TTAAAAAAAT ATTATGGTTA TAATAGGAAG AT ATCAT GGA 

T ACT ATT AG T AAAATTTCTA TAATTGTACC TATATATAAT GTAGAAAAAT ATTTATCTAA ATGTATAGAT 
AGCATTGTAA ATCAGACCTA CAAACATATA GAGATTCTTC 

TGGTGAATGA CGGTAGTACG GATAATTCGG AAGAAATTTG TTTAGCATAT GCGAAGAAAG ATAGTCGCAT 
TCGTTATTTT AAAAAAGAGA ACGGCGGGCT ATCAGATGCC 

CGTAATTATG GCATAAGTCG CGCCAAGGGT GACTACTTAG CTTTTATAGA CTCAGATGAT TTTATTCATT 
CGGAGTTCAT CCAACGTTTA CACGAAGCAA TTGAGAGAGA 

GAATGCCCTT GTGGCAGTTG CTGGTTATGA TAGGGTAGAT GCTTCGGGGC ATTTCTTAAC AGCAGAGCCG 
CTTCCTACAA ATCAGGCTGT TCTGAGCGGC AGGAATGTTT 

GTAAAAAGCT GCTAGAGGCG GATGGTCATC GCTTTGTGGT GGCCTGTAAT AAACTCTATA AAAAAGAACT 
ATTTGAAGAT TTTCGATTTG AAAAGGGTAA GAT TCATGAA 

GATGAATACT TCACTTATCG CTTGCTCTAT GAG TT AG AAA AAGTTGCAAT AGTTAAGGAG TGCTTGTACT 
ATTATGTTGA CCGAGAAAAT AGTATCACAA CTTCTAGCAT 

GACT GACCAT CGCTTCCATT GCCTACTGGA ATTTCAAAAT GAACGAATGG ACTTCTATGA AAGTAGAGGA 
GATAAAGAGC TCTTACTAGA GTGTTATCGT TCATTTTTAG 

CCTTTGCTGT TTTGTTTTTA GGCAAATATA ATCAT TGGTT GAGCAAACAG CAAAAGAAGC TT 
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RQTKLALFDM IAVAISAILT 
EFEKTFNYSI IFAIFLTAVS 
IIKQFKDSFL FSTIYQKKTI 
SLPLYYSVEE AIEFSTREW 
DINSFGFTAL KNKKIQLLGD 
LVPIIRRDGG PAI FAQKRVG 
MQGWVC FKMG KTILELLQLD 
GQKRRLSFKP GITGLWQVSG 
LKTVKWLLR EGSK 



SHIPNADLNR SGIFIIMMVH 
FLLENNFALS RRGAVYFTLI 
L2TTAERWEN MQVLFESRKQ 
DHVFINLPSE FLDVKQFVSD 
HSIVTFSTNF YKPSHIMMKR 
QNGRIFTFYK FRSMYVDAEE 
ISYAKTSLDE LPQFYNVLIG 
RSNITDFDDV VRLDLAYIDN 



YFAFFISRMP VEFEYRGNLI 
NFVLVYLFNV 

IQKtJLVRLW LGTEIDKINL 
FELLGIDVSV 

LLDILGAWG LIICGIVSIL 
RKKDLLSQNQ 

DMSLVGTRPP TVDEFEKYTP 
WTIWSDIKIL 



CPS1E 



DNA Serotype 1 SEQ ID NO:30 
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MKVCLVGSSG GHLTHLYLLK PFWKEEERFW VTFDKEDARS LLKNEKMYPC YFPTNRNLIN 
LVKNT FLAFK ILRDEKPDVI ISSGAAVAVP FFYIGKLFGA KTIYIEVFDR 
VNKSTLTGKL VYPVTDIFIV QWEEMKKVYP KSINLGSIF 



CPS1F 

DNA Serotype 1 SEQ ID NO:31 
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MIFVTVGTHE QQFNRLIKEI DLLKKNGSIT DEIFIQTGYS DYIPEYCKYK KFLSYKEMEQ 
YINKSEWIC HGGPATFMNS LSKGKKQLLF PRQKKYGEHV NDHQVEFVRR 
ILQDNNILFI ENIDDLFEKI IEVSKQTNFT SNNNFFCERL KQI VEKFNE D QENE 

CPS1G 
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SEQ ID NO:32 
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MFKLFKYDPE YFIFKYFWLI IFIPEQKYVF LLIFMNLILF HIKFLKTKLI LKNEILLFLL 
WSILCFVSW TSMFVEINFE RLFADFTAPI IWIIAIMYYN I.YSFINIDYK 
KLKNSIFFSF LVLLGISALY IIQNGKDIVF LDRHLIGLDY LI TGVKTRLV GFMNYPTLNT 
TTIIVSIPLI FALIKNKMQQ FFFLCLAFIP IYLSGSRIGS LSPLAILIIC 
LLWRYIGGKF AWIKKLIVIF VILLIILNTE LLYHEILAVY NSRESSNEAR FIIYQGSIDK 
VLENNILFGY GISEYSVTGT WLGSHSGYIS FFYKSGIVGL ILLMFSFFYV 
IKKSYGVNGE TALFYFTSLA IFFIYETIDP IIIILVLFFS SIGIWNNINF KKDMETKNE 

CPS1H 



DNA Serotype 1 SEQ ID NO:33 



FIG. 4F 
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MNDLISVIVP 
KYYKKINGGL 
NADIAEIDFC 
IKFQINNRSI 
SINNIDLVTR 
EIRRYPFIKA 



IYNVQDYLDK 
ADARNFGLEH 
LVDENGYTKK 
GEDLLFNLEV 
LENYPFKLKR 
KRYLSRKHLV 



CINSIINQTY 
ATGKYIAFVD 
KRNSNFHVLT 
LNNVTRVWD 
EFSHYFDAKV 
TLYLMKFSPK 



TNLEVILVND GSTDDSEKIC LNYMKNDGRI 
SDDYIEVAMF ERMHDNITEY 
REETVKEFLS GSNIENNVWC KLYSRDIIKD 
TREYYYNYVI RNSSLINQKF 
IKEKVKCLNK MYSTDCLDNE FLPILESYRK 
LYVMLYKKFQ KQ 



DNA Serotype 1 



CPS1I 
SEQ ID NO:34 



FIG. 4G 
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MDKISVIVPV 
FFTNHSGVSN 
DLSGCLYATF 
YITDLFQENQ 
KEGVFLQLEN 
IFRNLYKKYY 



YNVDKYLSSC 
ARNHGIKRST 
SENINNFEVN 
WLGEDLLFNL 
LQKQVIVLFK 
FNLLKVSNKN 



IESIINQNYK 
AEYIMFVDSD 
NPNIDFEAIN 
HYLKNIDRVS 
QIYGEDFDVS 
SLSKNFCIRI 



NIEILLIDDG SVDDSAKICK EYEKDKRVKI 
DWDSRLVEK LYFNIIKSRS 
TVQDMGEKNF MNLXXNNIFS TPVCXLYQKR 
YLTEHLYFYR RGILSTVNSF 
IVKDTIRWQV FYYSLLMFKY GKQSIFDKFL 
VSNKVFKKIL WL 



CPS1J 



DNA Serotype 1 



SEQ ID NO:35 



FIG. 4H 
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MDTISKISII VPIYNVEKYL SKCIDSIVNQ TYKHIEILLV NDGSTDNSEE ICLAYAKKDS 
RIRYFKKENG GLSDARNYGI SRAKGDYLAF IDSDDFIHSE FIQRLHEAIE 
RENALVAVAG YDRVDASGHF LTAEPLPTNQ AVLSGRNVCK KLLEADGHRF WACNKLYKK 
ELFEDFRFEK GKIHEDEYFT YRLLYELEKV AIVKECLYYY VDRENSITTS 
SMTDHRFHCL LEFQNERMDF YESRGDKELL LECYRSFLAF AVLFLGKYNH WLSKQQKK 

CPS1K 



DNA Serotype 1 SEQ ID NO:36 



FIG. 41 
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AAGCTTATCG TCAAGGTGTT CGCTATATCG TGGCGACATC T C ATAGACG A AAAGGGATGT 
TTGAAACACC AGAAAAAGTT ATCATGACTA ACTTTCTTCA AT T T AAAG AC 
GCAGTAGCAG AAGTTTATCC TGAAATACGA TTGTGCTATG GTGCTGAATT GTATTATAGT 
AAAGATATAT TAAGCAAACT TGAAAAAAAG AAAGTACCCA CACTTAATGG 
CTCGCGCTAT ATTCTTTTGG AGTTCAGTAG TGATACTCCT TGGAAAGAGA TTCAAGAAGC 
AG TGAACGAA GTGACGCTAC TTGGGCTAAC TCCCGTACTT GCCCATATAG 
AACGATATGA CGCCCTAGCG TTTCATGCAG AGAGAGTAGA AGAGTTAATT GACAAGGGAT 
GCTATACTCA GGTAAATAGT AATCATGTGC TGAAGCCCAC TTTAATTGGT 
GAT CGAGC AA AAGAATTTAA AAAACGTACT CGGTATTTTT TAGAGCAGGA TTTAGTACAT 
TGTGTTGCTA GCGATATGCA TAATTTATCT AGTAGACCTC CGTTTATGAG 
GGAGGCTTAT AAGTTGCTAA CAGAGGAATT TGGCAAAGAT AAAGCGAAAG CGTTGCTAAA 
AAAGAATCCT CTTATGCTAT TAAAAAACCA GGCGATTTAA ACT GGT TACT 
CTAGATTGTG GAGAGAAAAA TGGATTTAGG AACTGTTACT GATAAACTGT TAG AACG C AA 
CAGTAAACGA TTGATACTCG TGTGCATGGA TACGTGTCTT CTTATAGTTT 
CCATGATTTT GAGCAGACTG TTTTTGGATG TTATTATTGA CATACCAGAT GAACGCTTCA 
TTCTTGCAGT TTTATTCGTA TCAATTTTAT ATTTGATTCT ATCGTTTAGA 
TTAAAAGTCT TTTCATTAAT TACGCGTTAC ACAGGGTATC AGAGTTATGT AAAAATAGGA 
CTTAGTTTAA TATCTGCGCA TTCATTGTTT TTAATTATCT CAATGGTGTT 
GTGGCAGGCT TTTAGTTATC GTTTCATCTT AGTATCCTTA TTTTTGTCGT ATGTAATGCT 
CATTACTCCG AGGATTGTTT GGAAAGTCTT ACATGAGACG AGAAAAAATG 
CTATCCGTAA GAAGGATAGC CCACTAAGAA TCTTAGTAGT AGGTGCTGGA GATGGTGGTA 
ATATTTTTAT CAATACTGTC AAAGATCGAA AATTGAATTT TGAAATTGTC 
GGTATCGTTG ATCGTGATCC AAATAAACTT GGAACATTTA TCCGTACGGC TAAAGTTTTA 
GGAAACCGTA ATGATATTCC ACGACTGGTA GAGGAATTAG CTGTTGACCA 
AGTGACGATT GCCATCCCTT CTTTAAATGG TAAGGAGCGA GAGAAGATTG TTGAAATCTG 
TAACACTACA GGAGTGACCG TCAATAATAT GCCGAGTATT G AAG AC AT T A 
TGGCGGGGAA CATGTCTGTC AGTGCCTTTC AGGAAATTGA CGTAGCAGAC CTTCTTGGTC 
GACCAGAGGT TGTTTTGGAT CAGGATGAAT TGAATCAGTT TTTCCAAGGG 
AAAACAATCC TTGTCACAGG AGCAGGTGGC TCTATCGGTT CAGAGCTATG TCGTCAAATT 
GCTAAGTTTA CGCCTAAACG CTTGTTGTTG CTTGGACATG GAGAAAATTC 
AATCTATCTC ATTCATCGAG AGTTACTGGA AAAG T ACCAA GGTAAGATTG AGTTGGTCCC 
TCTCATTGCA GATATTCAAG ATAGAGAATT GATTTTTAGC ATAATGGCTG 
AATAT CAACC CGATGTTGTT TATCATGCTG CAGCACATAA GCATGTTCCT TTGATGGAAT 
ATAATCCACA TGAAGCAGTG AAGAATAATA TTTTTGGAAC GAAGAATGTG 
GCTGAGGCGG CTAAAACTGC AAAGGTTGCC AAATTTGTTA TGGTTTCAAC AGATAAAGCT 
GTTAATCCAC CAAATGTCAT GGGAGCGACT AAACGTGTTG CAGAAATGAT 
TGTTACAGGT TTAAACGAGC CAGGTCAGAC TCAATTTGCG GCAGTCCGGT TTGGGAATGT 
TCTAGGTAGT CGTGGAAGTG TTGTTCCGCT ATTCAAAGAG CAAATTAGAA 
AAGGTGGACC TGTTACGGTT ACCGACTTTA GGATGACTCG TTATTTCATG ACGATTCCTG 
AGGCAAGTCG TTTGGTTATC CAAGCTGGAC ATTTGGCAAA AGGTGGAGAA 
ATATTTGTCT TGGATATGGG CGAGCCAGTA CAAATCCTGG AATTGGCAAG AAAAGTTATC 
TTGTTAAGTG GACACACAGA GGAAGAAATC GGGATTGTAG AATCTGGAAT 
CAGACCAGGC GAGAAACTCT ACGAGGAATT ATTATCAACA GAAGAACGTG TCAGCGAACA 
GATT CATGAA AAAATATTTG TGGGTCGCGT TACAAATAAG CAGTCGGACA 
TTGTCAATTC ATTTATCAAT GGATTACTCC AAAAAGATAG AAATGAATTA AAAAATATGT 
TGATTGAATT TGCAAAACAA GAATAAGAAA GTAAAAAATA TTTTTACTTT 
CCTAGAGTTT AAACGATGTT TAAGTTCTAG GAAGGTTAGA ATACCTAATT AACAACAATA 
TTACTATTTA TTAAGAGTCA GATAATAGCA ACTAAGTGCT ACAAACTATC 
TTTATAATAA GTATATTTGG TCAAAAGGGA GATGTGAAAT GTATCCAATT TGTAAACGTA 
TTTTAGCAAT TATTATCTCA GGGATTGCTA TTGTTGTTCT GAGTCCAATT 
TTATTATTGA TTGCATTGGC AATTAAATTA GATTCTAAAG GTCCGGTATT AT TTAAACAA 
AAGCGGGTTG GTAAAAACAA GTCATACTTT AT GATT TATA AATTCCGTTC 
TATGTACGTT GACGCACCAA GTGATATGCC GACTCATCTA TTAAAGGATC CTAAGGCGAT 
GATTACCAAG GTGGGCGCGT TTCTCAGAAA AACAAGTTTA GATGAACTGC 
CACAGCTTTT TAATATTTTT AAAGGTGAAA TGGCGATTGT TGGTCCACGC CCAGCCTTAT 
GGAATCAATA TGACTTAATT GAAGAGCGAG ATAAATATGG TGCAAATGAT 
ATTCGTCCTG GACTAACCGG TTGGGCTCAA ATTAATGGTC GTGATGAATT GGAAATTGAT 
GAAAAGTCAA AAT TAG AT GG ATATTATGTT CAAAATATGA GTCTAGGTTT 
GG AT AT T AAA TGTTTCTTAG GTACATTCCT CAGTGTAGCC AGAAGCGAAG GTGTTGTTGA 
AGGTGGAACA GGGCAGAAAG GAAAAGGATG AAATTTTCAG TATTAATGTC 
GGTCTATGAG AAAGAAAAAC CAGAGTTTCT TAGGGAATCT TTGGAAAGCA TCCTTGTCAA 
TCAAACAATG AT TCCAAC G G AGGTTGTCTT GG TAGAGGAT GGGCCACTCA 
ATCAGAGCTT ATATAGTATT TTAGAAGAAT TTAAAAGTCG ATTTTCATTT TTTAAAACGA 
TAGCCTTGGA AAAGAATTCG GGTTTAGGAA TTGCACTGAA TGAAGGTTTG 
AAACATTGTA ATTATGAGTG GGTTTGCACG AAATGG ATT C TGATGATGTT GCATATACAT 
ACACGTTTTG AAAAGC AAG T TAACTTTATA AAACAAAACC CGACTATAGA 

DNA Serotype 9 FIG. 5A 
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TATTGAGATA GATGAGTTCT TAAATTCTAC 
AACCCAGCAC GATGAAATAT TAAAGATGGC 
GCCACATGAC TGTAATGTTT AAAAAGAAAA 
TTCCGTACGT AGAAGATTAT TTCCTTTGGG 
TCGAAATTTG CAAACATTGA TGAAACACTA 
AATAGGAGGG GGAACAGAGA ACAAATTAAC 
ATTTATGTTA GCTCAAGGAA TTGTTACACC 
TAGGGTCTTT GTTTATATGC CAACTTGGAT 
AAATCTTAAG GAAATAGTAT GATTACAGTA 
ATAATAAAAC AGTTAGATTC AATTCGAAAT 
AGTTATTATT TGGGATGATT GCTCGACAGA 
AAAAAAATAT TCTTTGGATT CATGGGTTGT 
AGGGGCATTA TCAAACATTT ATAAATTTGA 
TTTCAGATCA AGATGATATT TGGGACTGTC 
CCAATCTTTG ACAGAGAAAA TGTATCAATG 
AACGGAAATA TTATCAGTAG CCCAGATACT 
CTCTCTAGA 



DNA Serotype 9 



TAGTGAAATA GTTTCTCATA AAAATGTTCC 
AAGGCGGGAG AAATCCATGT 
GTGTCGAGAG AGCAGGGGGG TATCAAACAC 
TGCGCATGAT TGCTTCAGGA 
GTTCTTGCAC GTGTTGGAAA TGGGATGTTC 
AGTTGGACAT TACTAATTGA 
ACTAGATGTA TTTATTAATC AAATTTACAT 
AAAGAAACTC ATTTATGGAA 
TTGATGGCTA CATATAATGG AAGCCCATTT 
CAAAGTGTAT CAGCAGACAA 
T GAT AC AAT A AAAATAATAA AAGATTATAT 
CTCTCAAAAT AAATCTAATC 
CAAAGTTAGT TCAGGAAGGA ATAGTCTTTT 
ATAAAATTGA GACAATGCTT 
GTGTTTTGCA AATCCAGATT GATTGATGAA 
TCGGATAGAA TCAATACGTA 



SEQ ID NO:37 



FIG. 5B 
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AYRQGVRYIV ATSHRRKGMF ETPEKVIMTN FLQFKDAVAE VYPEIRLCYG AELYYSKDIL 

SKLEKKKVPT LNGSRYILLE FSSDTPWKEI QEAVNEVTLL GLTPVLAHIE 

RYDALAFHAE RVEELIDKGC YTQVNSNHVL KPTLIGDRAK EFKKRTRYFL EQDLVHCVAS 

DMHNLSSRPP FMREAYKLLT EE FGKDKAKA LLKKNPLMLL KNQAI 



CPS9D 



DNA Serotype 9 



SEQ ID NO:38 



FIG. 5C 
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MDLGTVT DKL 
YLILSFRLKV 
RFILVSLFLS 
KLNFEIVGIV 
SLNGKEREKI 
LNQFFQGKTI 
ELLEKYQGKI 
IFGTKNVAEA 
PGQTQFAAVR 
HLAKGGEIFV 
YEELLSTEER 



LERNSKRLIL 
FSLITRYTGY 
YVMLITPRIV. 
DRDPNKLGTF 
VEICNTTGVT 
LVTGAGGSIG 
ELVPLIADIQ 
AKTAKVAKFV 
FGNVLGSRGS 
LDMGEPVQIL 
VSEQIHEKIF 



VCMDTCLLIV 
QSYVKIGLSL 
WKVLHETRKN 
IRTAKVLGNR 
VNNMPSIEDI 
SELCRQIAKF 
DRELIFSIMA 
MVSTDKAVNP 
WPLFKEQIR 
ELARKVILLS 
VGRVTNKQSD 



SMILSRLFLD 
ISAHSLFLII 
AIRKKDSPLR 
NDI PRLVEEL 
MAGNMSVSAF 
T PKRLLLLGH 
EYQPDWYHA 
PNVMGATKRV 
KGGPVTVTDF 
GHTEEEIGIV 
IVNSFINGLL 



VIIDIPDERF 
SMVLWQAFSY 
I LWGAGDGG 
AVDQVTIAIP 
QEI DVADLLG 
GENSIYLIHR 
AAHKHVPLME 
AEMIVTGLNE 
RMTRYFMTIP 
ESGIRPGEKL 
QKDRNELKNM 



ILAVLFVSIL 
NIFINTVKDR 
RPEWLDQDE 
YNPHEAVKNN 
EASRLVIQAG 
LIEFAKQE 



CPS9E 



DNA Serotype 9 



SEQ ID NO:39 



FIG. 5D 
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MYPICKRILA IIISGIAIW LSPIL^LIAL AIKLDSKGPV LFKQKRVGKN KSYFMIYKFR 
SMYVDAPSDM PTHLLKDPKA MITKVGAFLR KTSLDELPQL FNIFKGEMAI 
VGPRPALWNQ YDLIEERDKY GANDIRPGLT GWAQINGRDE LEIDEKSKLD GYYVQNMSLG 
LDIKCFLGTF LSVARSEGW EGGTGQKGKG 



CPS9F 

DNA Serotype 9 SEQ ID NO:40 



FIG. 5E 
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MKFSVLMSVY EKEKPEFLRE SLESILVNQT MIPTEWLVE DGPLNQSLYS I LEE FKSRFS 
FFKTIALEKN SGLGIALNEG LKHCNYEWVC TKWILMMLHI HTRFEKQVNF 
IKQNPTIDIE IDEFLNSTSE IVSHKNVPTQ HDEILKMARR EKSMCHMTVM FKKKSVERAG 
GYQTLPYVED YFLWVRMIAS GSKFANIDET LVLARVGNGM FNRRGNREQI 
NSWTLLIEFM LAQGIVTPLD VFINQIYIRV FVYMPTWIKK LIYGKILRK 



CPS9G 

DNA Serotype 9 SEQ ID NO:41 



FIG. 5F 
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MITVLMATYN GSPFIIKQLD SIRNQSVSAD KVIIWDDCST DDTIKIIKDY IKKYSLDSWV 
VSQNKSNQGH YQTFINLTKL VQEGIVFFSD QDDIWDCHKI ETMLPIFDRE 
NVSMVFCKSR LIDENGNIIS SPDTSDRINT YSL 



CPS9H 



DNA Serotype 9 SEQ ID NO:42 



FIG. 5G 



CTGCAGCACA TAAGCATGTT 
ATATTTTTGG AACGAAGAAT 
GCCAAATTTG TTATGGTTTC 
ACTAAACGTG TTGCAGAAAT 
GACTCAATTT GCGGCAGTCC 
GCTATTCAAA GAGCAAATTA 
TTAGGATGAC TCGTTATTTC 
GACATTTGGC AAAAGGTGGA 
GTACAAATCC TGGAATTGGC 
ATCGGGATTG TAGAATCTGG 
ATTGTTATCA ACAGAAGAAC 
CGTTACAAAT AAGCAGTCGG 
T C C AAAAAG A TAGAAATGAA 
AAAGTAAAAA ATATTTTTAC 
TAGGAAGGTT GGAATTGCTT 
AAAGGATATT AAACTAAAGG 
GGTATTTAAT GCCAAACAGG 
AAGTTGCTTG AAAGAGAGTT 
AATATATATC TATTATTATC 
AAAATAATTT TTGGTGATTT 
G TACT TAT AT TCTTTTAGAA 
ACGAT GACT A GAGTAGAGTT 
AGCAACCAGT AAATATTTTC 
CTTTATGGAT AAGTTACTTG 
TAATC AT TAT ATTAGCTATT 
AAGAACGTGT TACGAGATAT 
AC AAT G ATT T CTGATGCGGA 
ATTACGAAAG TCGGTCACAT 
CCAACTTTTT AATGTTTTAA 
AAAAT AT GT A AATCAGTATA 
CTGCAGGAAT TACTTCACCA 
AATATTGTTC TCAAGGCTAT 
TTACCAGAAA AAATGAAGTA 
TTTAAAGTAA TGATTGATAC 
AATGACAAAA AGACAAAATA 
TGAAGTTATT GACACACTAA 
CAAAAGAGCT AGAACGTCGG 
ATTCTGCTAC TGCAGGATTG 
CCCGGAGATG AAGTTATTGT 
CAT GT AGGAG CAACTCCTGT 
GATGGAATAT GATGCTTTGG 
TGATCTAGCT GGTATTCCTT 
AAAACAAACG CTCTTTGTAT 
TTATTATCCT ATCTGATAGT 
AAACCAGCGG GTTCCCTAGC 
ACAACTGCTG AAGGAGGTAG 
TGACGAAGAG ATGTATAAAG 
ATTAGCTAAG ACACAATTAG 
GTTACAAGTG T AAT AT G AC A 
ACCCATCTTT GTTGAATCGT 
GGCTTTGAGG GGACTTCGAT 
ATGCACTTGT AT ATCAC GC A 
TGAAGTCATT CAAAAAATGG 
ACCTCTTCTC ACAGCCTACA 
CGAATGCCTA TCAGTATTTT 
AT GAAG AT GT GGAGTATGTG 
GATTAGTTAT TTTGGAAGGA 
TATCTATAAT AATGCCCTCG 
ATCCAGTCAG TGTTGGACCA 
TCTAATGACG AAACTGAAAA 
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CCATTGATGG AATATAATCC 
GTGGCTGAGG CGGCTAAAAC 
AACAGATAAA GCTGTTAATC 
GATTGTAACA GGTTTAAACG 
GTTTTGGGAA TGTTCTAGGT 
GAAAAGGTGG ACCTGTTACG 
ATGACGATTC CTGAGGCAAG 
GAAATCTTTG TCTTGGATAT 
AAGAAAAGTT ATCTTGTTAA 
AATCAGACCA GGCGAGAAAC 
GTGTCAGCGA ACAGATTCAT 
ACATTGTCAA TTCATTTATC 
TTAAAAGATA TGTTGATTGA 
TTTCCTAGAG TTTAAACGAT 
TCGTGGAGGT GATAGATAGA 
TGAATCGGAA CATAAAGTTT 
TGAATGCAAC CTCTCGCTCG 
TGTTAATCAG TATAAGTAGG 
GGTAATGATA CTATTATTGA 
TATCGTCCGA CTTAAAGGTG 
TTGATGAAAA ATATGGGGGA 
GATTACTAGA GAATTTTTTA 
AGAAGATAGA ATCAAGAAGA 
CGCTTATCCT ATTATTGCTA 
TGGATAAAAT TAGATAG TAA 
GGTCGAATTT T TAG AAT AT T 
TAAAGTCGGA AGTCTTGTCA 
TATCAGAAAA TATCGGCTGG 
TGGGGGATAT GAGCTTTGTA 
CTGATGAAAT GTTTGCGACG 
GCGAGTATTG CATATAAGGA 
AGTCCTGATG AAGCAT AT GT 
CAATTTGGAA TAT AT CAGAA 
AGTAATTAAA G T AAT AAAAT 
TTCCATTTTC ACCACCAGAT 
AATCTGGTTG GAT T ACAACA 
CTATCAGTAT TTACAGGAAC 
GAACTAGTCT TACGAATTCT 
TCCTGCTATG ACCTATACTG 
GATGGTTGAT ATTCAAAAAA 
AAAAAGCGAT TACTCCGAAA 
GTGATTATGA TAAGATTTAT 
GTTGCTTCTG ATAATAAATG 
GCACACTCAC TAGGTGCTAG 
AGATTTTACC TCATTTTCTT 
TGTGACATGG AG AT C ACATC 
AGTTTCAGAT TTACTCTCTT 
GGTCATGGGA AT AT G AC ATT 
GAT AT TAT GG CAGGTATCGG 
CGCAGAGAAA TC AT TGAGAA 
TAAGCCGTTG GTACACCTGA 
TCTACAAGGC TATACTTTAG 
CTGAAGCAGG TATTGCGTGC 
AGAATCTTGG TTTTGAAATG 
GAAAATGAAG TTACACTGCC 
ATAGAAATGT TTTTAAAAAT 
GATATGGTGG AAAGAGATAT 
TGGAATACAG CTAAGTATAT 
AACACACCAA AATTGGGAAC 
AGTTGTTTCG CATTTCAAAG 



ACAT GAAGC A GTGAAGAATA 
TGCAAAGGTT 

CGCCAAATGT CATGGGAGCG 
AGCCAGGTCA 

AGTCGTGGAA GTGTTGTTCC 
GTTACCGACT 

TCGTTTGGTT ATCCAAGCTG 
GGGTGAGCCA 

GCGGACATAC AGAGGAAGAA 
TCTACGAGGA 

GAAAAAATAT TTGTGGGTCG 
AATGGATTAC 

ATTTGCAAAA CAAGAATAAG 
GTTTAAGTTC 

AACCTATATA TTTGTAGAAG 
AGATAGAGTT 

T TACT AAGCA GGAGATAGTA 
CTAAAGTGAG 

GAATTATTGT AGTGGGGATA 
GGTTAAAAAA 

ATATAATATT TATAGGAGAT 
AGAAGAATGA 

G G T G AAT TAT TTATTAAATT 
TTATCCCCAG 

GGGGCCAATT TTTTATCGCC 
TAAGTTTAGA 

CAGTCGGTCA AGATAATCGT 
ACGAAGTGCC 

GGTGTAAGAC CAGAAGTACA 
TTACTTTTAC 

T GAAG AT ATT GTTTTAGAAG 
TCAAAAAGTA 

ACTTTGGAAT TATTTCTGAT 
AGGAGATTAA 

ATTACCCAAG CTGAAATTGA 
GGACCAAAGA 

CAATAAAACT GTGTGTTTAA 
TGGTGTTGGA 

CCTCATGTAG TGTCATTACT 
ACAGCTTTGA 

AC AAAAGT T A TCATTCCTGT 
ACCATCGTAG 

GCAGAAACTT TTTGGGCGAG 
T TAT AAGGGA 

TCCATGCAGT TAAGAATTTT 
CTGATTTGGA 

CATGGTCAGA CAAAGGATGC 
GTTATTCCTG 

TCTTGTGCAA TTAGAACGTT 
ATACAATGCT 

CGGAAGATAA ACAATCGTCT 
AACAACGAAA 

AATGTTCACT ACAAACCATT 
AAAGATTTTC 

TCTTCATACC AACTTGAGTG 
TGTTAGTAGA 

GGTGGAAAGA GACACGTTGG 
ATCTGAATCA 

TTATAATCGT TGATGATTGT 
ATTCAAGAAT 



DNA Serotype 7 



FIG. 6A 
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AAAGTTTTTT AAAAATTCGA 
AAAAGCTAGA GGTAGGTGGA 
ACCCGAGTAA GCTAGAAAAA 
ATCACAATTT TGAAAAGATT 
GTGTCAGGAC CAGCAATTGT 
TTGACTTTCA TGTATGATGC 
TATAAAGAAA AATAACGATT 
TCTTTTAAAT GAAAGTTTAG 



DNA Serotype 7 



ATAATTTAGG GGCAGCTCTA 
TTGCGTTCTT GGATTCAGAT 
CAGCTTGAAT TTATGAAAAA 
GATGAATCTA GTCAGTCTTT 
GACTAGAAAA ATGATGTACA 
AGACAAAATG GGTTTAATTC 
ATGCGATATT ACTTCAATTG 
CTTCGTATCG AATTAGAAAA 



ACACGAAATA AGGCACTAAG 
GATTTATGGC 

TAATGGATAT TCATTTACTT 
ACGTGTCCTG 

ATTACGGCTA TCCAGGGTGT 
AGATAAAAGA 

TGTAAGAAGT ATGACTGTTA 
AA 



SEQ ID NO:43 



FIG. 6B 
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AAHKHVPLME YNPHEAVKNN IFGTKNVAEA AKTAKVAKFV MVSTDKAVNP PNVMGATKRV 
AEMIVTGLNE PGQTQ FAAVR FGNVLGSRGS VVPLFKEQIR KGGPVTVTDF 
RMTRYFMTIP EASRLVIQAG HLAKGGEIFV LDMGEPVQIL ELARKVILLS GHTEEEIGIV 
ESGIRPGEKL YEELLSTEER VSEQIHEKIF VGRVTNKQSD IVNSFINGLL 
QKDRNELKDM LIEFAKQE 

CPS7E 

DNA Serotype 7 SEQ ID NO:44 



FIG. 6C 
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MTRVELITRE FFKKNEATSK 
IKLDSKGPIF YRQERVTRYG 
TKVGHIIRKY RLDEVPQLFN 
SIAYKDEDIV LEEYCSQGYS 
KVMIDTVIKV IK 



YFQKIESRRG ELFIKFFMDK 
RIFRIFKFRT MISDADKVGS 
VLMGDMS FVG VRPEVQKYVN 
PDEAYVQKVL PEKMECYNLEY 



LLALILLLLL SPVIIILAIW 
LVTVGQDNRI 

QYTDEMFATL LLPAGITSPA 
IRNFGIISDF 



CPS7F 



DNA Serotype 7 SEQ ID NO:45 



FIG. 6D 



4 
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MTKRQNIPFS PPDITQAEID EVIDTLKSGW ITTGPKTKEL ERRLSVFTGT NKTVCLNSAT 
AGLELVLRIL GVGPGDEVIV PAMTYTASCS VITHVGATPV MVDIQKNSFE 
MEYDALEKAI TPKTKVIIPV DLAGIPCDYD KIYTIVENKR SLYVASDNKW QKLFGRVIIL 
SDSAHSLGAS YKGKPAGSLA DFTSFSFHAV KNFTTAEGGS VTWRSHPDLD 
DEEMYKEFQI YSLHGQTKDA LAKTQLGSWE YDIVIPGYKC NMTDIMAGIG LVQLERYPSL 
LNRRREIIEK YNAGFEGTSI KPLVHLTEDK QSSMHLYITH LQGYTLEQRN 
EVIQKMAEAG IACNVHYKPL PLLTAYKNLG FEMKDFPNAY QYFENEVTLP LHTNLSDEDV 
EYVTEMFLKI VSRD 



CPS7G 



DNA Serotype 7 SEQ ID NO:46 



FIG. 6E 
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MVERDMVERD TLVSIIMPSW NTAKYISESI QSVLDQTHQN WELIIVDDCS NDETEKWSH 
FKDSRIKFFK NSNNLGAALT RNKALRKARG RWIAFLDSDD LWHPSKLEKQ 
LEFMKNNGYS FTYHNFEKID ESSQSLRVLV SGPAIVTRKM MYNYGYPGCL TFMYDADKMG 
LIQIKDIKKN NDYAILLQLC KKYDCYLLNE SLASYRIRK 



CPS7H 



DNA Serotype 7 SEQ ID NO:47 



FIG. 6F 
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